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(54) Title: GROUP B STREPTOCOCCUS VACCINE 

(57) Abstract: This application relates to improved Group B Streptococcus ("GBS") saccharide-based vaccines comprising com- 
binations of GBS polysaccharides with polypeptide antigens, and vice versa, such that the polypeptide and the saccharide each 
contribute to the immunological response in a recipient. The combination is particularly advantageous where the saccharide and 
polypeptide are from different GBS serotypes. The combined antigens may be present as a simple combination where separate 
saccharide and polypeptide antigens are administered together, or they may be present as a conjugated combination, where the sac- 
charide and polypeptide antigens are covalently linked to each other. Preferably, the immunogenic compositions of the invention 
comprise a GBS saccharide antigen and at least two GBS polypeptide antigens, wherein said GBS saccharide antigen comprises a 
saccharide selected form GBS serotype la, lb, and III, and wherein said GBS polypeptide antigens comprise a combination of at least 
two polypeptide or fragments thereof selected from the antigen group consisting of GBS 80, GBS 91, GBS 104, GBS 147, GBS 173, 
GBS 276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, GBS 361, GBS 404, GBS 656, GBS 690, and 
GBS 691. 
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GROUP B STREPTOCOCCUS VACCINE 

5 

This application claims the benefit of U.S. Provisional Patent Application Serial No. 
60/410,839, filed September 13, 2002, which application is incorporated herein by reference in its 
entirety. 

TECHNICAL FIELD 

10 This invention relates to polysaccharides from the bacteria Streptococcus agalactiae (GBS) 

and to their use in immunisation. 
BACKGROUND ART 

Once thought to infect only cows, the Gram-positive bacterium Streptococcus agalactiae (or 
"group B streptococcus", abbreviated to "GBS" (Ref. 1) is now known to cause serious disease, 

15 bacteremia and meningitis, in immunocompromised individuals and in neonates. There are two . 
types of neonatal infection. The first (early onset, usually within 5 days of birth) is manifested by 
bacteremia and pneumonia. It is contracted vertically as a baby passes through the birth canal. GBS 
colonises the vagina of about 25% of young women, and approximately 1% of infants born via a 
vaginal birth to colonised mothers will become infected. Mortality is between 50-70%. The second 

20 is a meningitis that occurs 10 to 60 days after birth. If pregnant women are vaccinated with type III 
capsule so that the infants are passively immunised, the incidence of the late onset meningitis is 
reduced but is not entirely eliminated. 

The "B" in "GBS" refers to the Lancefield classification, which is based on the antigenicity 
of a carbohydrate which is soluble in dilute acid and called the C carbohydrate. Lancefield identified 

25 13 types of C carbohydrate, designated A to O, that could be serologically differentiated. The 

organisms that most commonly infect humans are found in groups A, B, D, and G. Within group B, 
strains can be divided into at least 9 serotypes (la, lb, Ia/c, II, III, IV, V, VI, VII and VIII) based on 
the structure of their polysaccharide capsule. In the past, serotypes la, lb, II, and III were equally 
prevalent in normal vaginal carriage and early onset sepsis in newborns. Type V GBS has emerged 

30 as an important cause of GBS infection in the USA, however, and strains of types VI and VIII have 
become prevalent among Japanese women. 

The genome sequence of a serotype V strain 2603 V/R has been published (Ref. 2) and 
various polypeptides for use a vaccine antigens have been identified (Ref. 3). The vaccines currently 
in clinical trials, however, are based on polysaccharide antigens. These suffer from serotype- 

35 specificity and poor immunogenicity, and so there is a need for effective vaccines against 
S. agalactiae infection. 
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It is an object of the invention to provide further and improved GBS vaccines. 

DISCLOSURE OF THE INVENTION 

The inventors have realised that saccharide-based vaccines can be improved by using them in 
5 combination with polypeptide antigens, and vice versa, such that the polypeptide and the saccharide 
each contribute to the immunological response in a recipient. The combination is particularly 
advantageous where the saccharide and polypeptide are from different GBS serotypes. 

The combined antigens may be present as a simple combination where separate saccharide 
and polypeptide antigens are administered together, or they may be present as a conjugated 
10 combination, where the saccharide and polypeptide antigens are covalently linked to each other. 

Thus the invention provides an immunogenic composition comprising (i) one or more GBS 
polypeptide antigens and (ii) one or more GBS saccharide antigens. The polypeptide and the 
polysaccharide may advantageously be covalently linked to each other to form a conjugate. 

Between them, the combined polypeptide and saccharide antigens preferably cover two or 
15 more GBS serotypes {e.g. 2, 3, 4, 5, 6, 7, 8 or more serotypes). The serotypes of the polypeptide and 
saccharide antigens may or may not overlap. For example, the polypeptide might protect against 
serogroup II or V, while the saccharide protects against either serogroups la, lb, or HI. Preferred 
combinations protect against the following groups of serotypes: (1) serotypes la and lb, (2) serotypes 
la and II, (3) serotypes la and III, (4) serotypes la and IV, (5) serotypes la and V, (6) serotypes la and 
20 VI, (7) serotypes la and VII, (8) serotypes la and VIII, (9) serotypes lb and n, (10) serotypes lb and 
III, (11) serotypes lb and IV, (12) serotypes lb and V, (13) serotypes lb and VI, (14) serotypes lb and 
VII, (15) serotypes lb and VIII, 16) serotypes II and III, (17) serotypes II and IV, (18) serotypes II 
and V, (19) serotypes II and VI, (20) serotypes II and VII, (21) serotypes II and VII, (22) serotypes 
III and IV, (23) serotypes III and V, (24) serotypes III and VI, (25) serotypes III and VII, (26) 
25 serotypes III and VIII, (27) serotypes IV and V, (28) serotypes IV and VI, (29) serotypes IV and VII, 
(30) serotypes IV and VIII, (31) serotypes V and VI, (32) serotypes V and VII, (33) serotypes V and 
VHI, (34) serotypes VI and VII, (35) serotypes VI and VIII, and (36) serotypes VII and VEIL 

Still more preferably, the combinations protect against the following groups of serotypes: (1) 
serotypes la and II, (2) serotypes la and V, (3) serotypes lb and II, (4) serotypes lb and V, (5) 
30 serotypes III and II, and (6) serotypes III and V. Most preferably, the combinations protect against 
serotypes III and V. 

Protection against serotypes II and V is preferably provided by polypeptide antigens. 
Protection against serotypes la, lb and/or EI may be polypeptide or saccharide antigens. 

Preferably, the immunogenic composition comprises one or more serogroup V antigens or 
35 fragments thereof selected from the antigen group consisting of GBS 80, GBS 91, GBS 104, GBS 
147, GBS 173, GBS 276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, 
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GBS 361, GBS 404, GBS 656, GBS 690, and GBS 691. Preferably, the composition comprises a 
composition of at least two of these GBS antigens or a fragment thereof. 

In one embodiment, the immunogenic composition comprises a GBS saccharide antigen and 
at least two GBS polypeptide antigens or fragments thereof, wherein said GBS saccharide antigen 
5 comprises a saccharide selected from GBS serotype la, lb, and III, and wherein said GBS 

polypeptide antigens comprise a combination of at least two polypeptide or a fragment thereof 
selected from the antigen group consisting of GBS 80, GBS 91, GBS 104, GBS 147, GBS 173, GBS 
276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, GBS 361, GBS 404, 
GBS 656, GBS 690, and GBS 691 . 
10 Preferably, the combination comprises GBS 80 or a fragment thereof. In one embodiment, 

the GBS polypeptide antigens comprise a combination of two GBS antigens or fragments thereof 
selected from the antigen group consisting of (1) GBS 80 and GBS 91, (2) GBS 80 and GBS 104, (3) 
GBS 80 and GBS 147, (4) GBS 80 and GBS 173, (5) GBS 80 and GBS 276, (6) GBS 80 and GBS 
305, (7) GBS 80 and GBS 313, (8) GBS 80 and GBS 322, (9) GBS 80 and GBS 328, (10) GBS 80 
15 and GBS 330, (11) GBS 80 and GBS 338, (12) GBS 80 and GBS 358, (13) GBS 80 and GBS 361, 
(14) GBS 80 and GBS 404, (14) GBS 80 and GBS 404, (15) GBS 80 and GBS 656, (16) GBS 80 and 
GBS 690, and (17) GBS 80 and GBS 691. 

Still more preferably, the combination is selected from the antigen group consisting of (1) 
GBS 80 and GBS 338; (2) GBS 80 and GBS 361, (3) GBS 80 and GBS 305, (4) GBS 80 and GBS 
20 328, (5) GBS 80 and GBS 690, (6) GBS 80 and GBS 691 and (7) GBS 80 and GBS 147. Even more 
preferably, the combination comprises GBS 80 and GBS 691. 

In one embodiment, the composition comprises a combination at least three GBS polypeptide 
antigens. Preferably, this combination comprises GBS 80 and GBS 691. 

Preferably, the immunogenic composition further comprises a GBS polypeptide or a 
25 fragment thereof of serogroup II. 
The polypeptide antigen 

The polypeptide is preferably: (a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of the even-numbered SEQ IDs 2-10966 from Ref. 3; (b) a polypeptide 
comprising an amino acid sequence having sequence identity to an amino acid sequence from in (a); 
30 or (c) a polypeptide comprising a fragment of an amino acid sequence from (a). 

Within (a), preferred SEQ IDs are those which encode GBS1 to GBS689 (see Table IV of 
reference 3). 

Within (b), the degree of sequence identity may vary depending on the amino acid sequence 
(a) in question, but is preferably greater than 50% (e.g. 60%, 70%, 80%, 90%, 95%, 99% or more). 
35 Polypeptides within (b) include homologs, orthologs, allelic variants and functional mutants of (a). 
Typically, 50% identity or more between two proteins is considered to be an indication of functional 
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equivalence. Identity between proteins is preferably determined by the Smith-Waterman homology 
search algorithm as implemented in the MPSRCH program (Oxford Molecular), using an affine gap 
search with parameters gap open penalty-12 and gap extension penalty =L 

Within (c), the length of the fragment may vary depending on the amino acid sequence (a) in 
5 question, but the fragment is preferably at least 7 consecutive amino acids from the sequences of (a) 
e.g. 8, 10, 12, 14, 16, 18, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200 or more. Preferably the 
fragment comprises one or more epitopes from the sequence. Other preferred fragments are the 
N-terminal signal peptides of SEQ IDs 1-10966 from Ref. 3, SEQ IDs 1-10966 from Ref. 3 without 
their N-terminal signal peptides, and SEQ IDs 1-10966 from Ref. 3 wherein up to 10 amino acid 

10 residues (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 residues) are deleted from the N-terminus and/or the 
C-terminus e.g. the N-terminal amino acid residue may be deleted. 

The polypeptides can, of course, be prepared by various means (e.g. recombinant expression, 
purification from GBS, chemical synthesis etc.) and in various forms (e.g. native, fusions, 
glycosylated, non-glycosylated etc.). They are preferably prepared in substantially pure form (i.e. 

15 substantially free from other streptococcal or host cell proteins) or substantially isolated form. 

Preferred polypeptide antigens are: GBS 80, GBS 91, GBS 104, GBS 147, GBS 173, GBS 
276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, GBS 361, GBS 404, 
GBS 656, GBS 690, and GBS 691, including polypeptides having amino acid sequences with 
sequence identity thereto etc. 

20 The nucleotide and amino acid sequences of GBS80 in Ref. 3 are SEQ ID 8779 and SEQ ID 

8780. These sequences are set forth below as SEQ ID NOS 1 and 2: 
SEQ ID NO. 1 

ATGAAATTATCGAAGAAGTTATTGTTTTCGGCTGCTGTT^ 
GACTGGAATGAGTATTGTAAGAGCTGCAGAAGTGTCACAAGAACGCCC^ 
25 ATAGTTATAAATCGGAAATTACTTCTAATGGTGGTATCGAGAATAAAGACGGCGAAGTAATATCTAACTATGCTAAACTTGGTGAC 
AATGTAAAAGGTTTGCAAGGTGTACAGTTTAAACGTTAT^ 

AGTTGAAGCAGCAGATGCAAAAGTTGGAACGATTCTTGAAGAAGGTGTCAGTCTACCTCAAAAAA 

TCGATGCTCTGGATTCAAAAAGTAATGTGAGATACTTGTATGTAGAAGAT TTAAAGAATT CAC CTT CAAACATTAC CAAAGCTTAT 
GCTGTACCGTTTGTGTTGGAATTAC(^GTTGCTAACTCTACAGGTA(^GGTTTCCTTTCTGAAATTAATATTTACCCTAAAAACGT 
30 TGTAAC TGATGAACCAAAAACAGATAAAGATGT TAAAAAATTAGGT CAGGACGATGCAGGTTATACGATTGGTGAAGAATTCAAAT 
GGTTCTTGAAATCTACAATCCCTGCCAATTTAGGTGACTATGAAAAATTTGAAATTACTGATAAATTTGCAGATGGCTTGACTTAT 
AAATCTGTTGGAAAAATCAAGATTGGTTCGAAAACACTGAATA 

AAATACATTAAAAATTACGTTTAAACCAGAGAAATTTAAAGAAATTGCTGAGCTACTTAAAGGAATG 
ATGCTCTTGATAAAGCTACTGCAAATAGAGATGATGCGGCATTTTTGGAAATTCCAGTTGCATCAACTA 
35 TTAGGAAAAGCAATTGAAAATACTTTTGAACTTCAATATGACCATACTCCTGATAAAGCTGACAAT 
AAGAAAACCAGAAGTTCATACTGGTGGGAAACGATTTGTAAAGAAA 

ATTTGTTGGCTTCTGATGGGACAGmGTAAAATGGACAGATGCTCTTATTAAAGCGAATACTAATAAAAACTATATTGCTGGAG^ 
GCTGTTACTGGGCAACCAATCAAATTGAAATCACATACAGACGGTACG 
TGC^GAGGGTA^GCAGTAACTTACAAATTAAAAGAAA 
40 CAGTATCACAAACATCTTATAATACAAAACCAAC^ 

AAACGTCCTTCAATCCCTAATACTGGTGGTATTGGTACGGCTATCTTTGTCGCTATCGGTGCTGCGGTGATGGCTTTTGCTGTTAA 
GGGGATGAAGCGT CGTACAAAAGATAAC 

SEQ ID NO: 2 

45 MKLSKXIjLFSAAVLTMVAGS TVEPVAQFATGMS I VRAAEVSQERPAKTTVNI YKLQADS YKSE ITSNGGI ENKDGEVI SNYAKLGD 
NVKGLQGVQFKRYKVKTDISVDELKKLTOT 
AVPFVLELPVANSTGTGFLSEINIYPK2WVTDEPKTO 

KS VGKI KI GS KTLNRDEHYTIDEPTVDNQNTLKI TFKPEKFKEIAELLKGMTLVKNQDALDKATANTDDAAFLEI PVASTINEKAV 
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LGKAIENTFELQYDHTPDKADNPKPSNPPR 
AVTGQPIKLKSHTDGTFEIKGIAYAVDANAEGTATO 
KRPS I PNTGG I GTAI F VAI GAAVMAFAVKGMKRRTKDN 

5 The nucleotide and amino acid sequences of GBS 91 in Ref. 3 are SEQ ID 8937 and SEQ ID 

8938. These sequences are set forth below as SEQ ID NOS 3 and 4: 
SEQ ID NO. 3 

ATGAAAAAAGGACAAGTAAATGATAC TAAG CAATCTTACTCTCTACGTAAATATAAATTTGGTTTAGCAT CAGTAATTTTAGGGTC 
ATT CATAATGGTCACAAGTC CTGTT T TTGCGGATCAAACTACATCGGTTCAAGTTAATAAT CAGACAGGCACTAGTGTGGATGCTA 
10 ATAATTCTTCCAATGAGACAAGTGCGTC^GTGTGATT 

CAAAATACGGCAACAAAGGACATTACTACTCCTTTAGTAGAGACAAAGCCAATG^ 
TGTTTATAGCAAAGAAACCGAGGTGAAAAATACACCTTCAAAATCAGC^ 

TCTATGACCAAGTATTTAATAAAGATAATGTGAAATGGATTTCATATAAGTCTTTTTGTGGCGTACGTCGATACGCAGCTATTGAG 
TCACTAGATC CAT CAGGAGGTTCAGAGACTAAAGCAC CTACTC CTGTAACJAAAT^ CAAGAGAAAATAGCAAC 
1 5 GCAAGGAAATTATACATTTTCACATAAAGTAGAAGTAAAAAATGAAGCTAAGGTAGCGAGTCCAACTC^ 

GAGACAGAAT TTTTTACGAC CAAATACTAACTATTGAAGGAAATCAGTGGTTATCTTATAAAT CATTCAATGGTGT T CGT CGT TTT 
GTTTTGCTAGGTAAAGCATCTTCAGTAGAAAAAACTGAAGATAAAGAAAAAGTGTC 

TGGTAGACTGACTATTTCTAACGAAACAACTACAGGTTTTGATATTTTAATTACGAATATTAAAGATGATAACGGTATCGCTGCTG 
TTAAGGTACCGGTTTGGACTGAACMGGAGGGCAAGATGATATTAAATGGTATACAGCTGTAACTACTGGGG^ 

20 GTAGCTGTATCATTTGCTGACCATAAGAATGAGAAGGGTCTTTATAATATTC^ 

AGGTGTAACAGGAACTAAAGTGACAGTAGCTGGAACTAATTCTTCTCAAGAACCTATTGAAAATGGTTTAGCAAAGACTGGTGTTT 
ATAATATTATCGGAAGTACTGAAGTAAAAAATGAAGCTAAAATATCAAGTCAGACCCAATTTACTTTAGAAAAAG^ 
AATTATGATCAAGTATTGACAGCAGATGGTTACCAGTGGATTTCTTACAAATCTTATAGTGGTGTTCGTCGCT 
AAAGCTAACTACAAGTAGTGAAAAAGCGAAAGATGAGGCGACTAAACCGACTAGTTATCCCAACTTACCTAAAACAGGTACCTATA 

25 CATTTACTAAAACTGTAGATGTGAAAAGTCAACCTAAAGTATCMGTCCAGTGGAATTTAATTTTCAAAAGGGTG 

TATGAT CAAGTGT TAGTAGTAGATGGTCAT CAGTGGATTTCATACAAGAGTTATTC CGGTATT CGTCGCTATATTGAAATT 

SEQ ID NO. 4 

MKKGQVNDTKQSYSLRKYKFGLASVILGSFIMVTSPVFAD^ 
30 QNTATKD I TTPLVETKPMVE KTLPEQGNYV YS KETE VKNTP S KS AP VAF YAKKGDKVF YDQVFNKDNVKW I S YKS F CGVRRYAAI E 
SLDPSGGSETKAPTPVTNSGSNNQEKIATQGNYTFSHKVEV^ 
VLLGKASSVEKTEDKEKVSPQPQARITKTGRLT 

VAVS FADHKNEKGL YNIHLYYQEASGTLVGVTGTKVTVAGTNS SQEP I ENGLAKTGVYNI I GS TEVKNEAKI S SQTQFTLEKGDKI 
ITYDQVLTADGYQWISYKSYSGVRRYIPVKX^ 
35 YDQVLWDGHQWI SYKSYSGIRRYIEI 

The nucleotide and amino acid sequences of GBS 104 in Ref. 3 are SEQ ID 8777 and SEQ 
ID 8778. These sequences are set forth below as SEQ ID NOS 5 and 6: 
SEQ ID NO. 5 

40 ATGAAAAAGAGACAAAAAATATGGAGAGGGTTATCAGTTACTTTACTAAT CCTGT C CCAAATT C CATTTGGTATAT TGGTACAAGG 
TGAAkCCCAAGATACCAATCAAGCACTTGGAA?^GTAATTGTTAAAAA 

TTGTGT TAAAAAATGACAATGATAAGTCAGAAACAAGTCACGAAACGGTAGAGGGTTCTGGAGAAG CAACCTT TGAAAACATAAAA 
CCTGGAGACTACACATTAAGAGAAGAAACAGCACCAATT^ 

CGGAGCAACAATAATCGAGGGTATGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGCCCAAT^ 

45 TTTATGAGGATACAAAAGAAAATTACCCATTAGTTAATGTAGAGGGTTCCAA^ 

AATGGAAAAGATGGT CGAAGAGAGAT TGCTGAAGGTTGGTTAT CAAAAAAAAT TACAGGGGT CAATGATCTCGATAAGAATAAATA 
TAAAATTGAATTAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAACTT^ 
ATTCAAATAGTATGAATAATGAAAGAGCCAATAATTCTCAAAGAGCATTAAAAGCTG^ 
ATTACATCAAATAAAGACAATAGAGTAGCTCTTGTGACATATGCCTC^ 

50 AGTTGCCGATCAAAATGGTAAA.GCGCTGAATGATAGTGTATCATGG 

ACAGTTATTTAAATTTAACAAATGATGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAAGGAAGCGGAG 
GATCGCACGCTCTATCAATTTGGTGCGACATTTACTCAAAAAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAAAGTTCT^ 
TGCTAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTTATGCCATAAATTTTAATCCTTATATATCAACAT 
CT TACCAAAAC CAGTT TAATT CTTTTTTAAATAAAATACCAGATAGAAGTGGTATTCTCCAAGAGGAT T TTATAAT CAATGGTGAT 

55 GATTAT CAAATAGTAAAAGGAGATGGAGAGAGT TTTAAACTGT T TTCGGATAGAAAAGTT CCTGTTACTGGAGGAACGACACAAGC 
AGCTTATCGAGTACCGCAAAATCAACTCTCTGTAATGAGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGA 
GAGATTACAACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTGCAACGAAACAAATCAAAA 
ACATTATACTTTAATGGAAATATAAGACCTAAAGGTTATGAC^TTTTTACTGTTGGGATTGGTGTAAACGGAGATCCTGGT^ 
TC CTCTTGAAGCTGAGAAATTTATGCAAT CAATAT CAAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATG 

60 ATGAGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAAACATTCTATTGTTGATGGAAATGTGACTGAT 

ATTGAATTCCAATTAAAAAATGGTCAAAGTTTTACACATGATGATTACGTTTTGGTTGGAAATGATGGCAGTCAAT^ 
TGTGGCTCTTGGTGGACCAAACAGTGATGGGGGAATTTTAAA&GATGTTACAGTGACCT 
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TCAATCATTTGAACTTAGGAAGTGGACAAAAAGTAGTTCT 

TACAATACAAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATACTATTCGTGATTTC C CAATT C CCAAAATTCG 
TGATGTTCGTGAGTTTCCGGTACTAACCATCAGT^ 
ATTCAGAATCGCTTTTGGGAGCTAAGTTTCAACTTC^ 
5 GATGTTACAACAAAGAATGATGGTAAAATTTATTTTA^ 

TGGCTATATAGAGGT TAAAACGAAACCTGT TGTGACATTTACAATT CAAAATGGAGAAGTTACGAACC TGAAAGCAGAT C CAAATG 
CTAATAAAAATCAAATCGGGTATCTTGAAGGAAATGGTAAACATCTTATTACCAACACTCCCAAA 

AAAACAGGGGGAATTGGTACAATTGTCTATATATTAGTTGGTTCTACTTTTATGATACTTACCATTTGTTCTTTCCGTCGTAAACA 
ATTG 

10 

SEQ ID NO. 6 

MKKRQKIWRGLSVTLLILSQIPFGILVQGET^ 
PGDYTLREETAPIGYKKTDKTWKVKVADNGATI^ 

NGKDGRRE I AEGWLSKKI TGVMDLDKNKYKI ELTVEGKT WETKELNQPLDWVLLDNSNSMNNERAIOTJS QRALKAGEAVEKLIDK 

15 ITSNKDNRVALVTYASTIFDGTEATVSKGV 

DRTLYQFGATFTQKALMKANEIIiETQS SNARKKLIFHVTDGVPTMS YAINFNPYI STS YQNQFNS FLNKI PDRSGI LQEDFI INGD 
DYQIVKGDGESFKLFSDRK^PVTGGTTQAAYRVPQNQLSVMSNEGYAIITSGYIYLYWRDYNWYPFDPKTKKVSATKQIKTHGEPT 
TLYFNGNI RPKGYDI FTVGI GVNGDPGATPLEAEKFMQS I S SKTENYTNVDDTNKI YDELiNKYFKTI VEEKHS I VDGNVTDPMGEM 
IEFQLKNGQS FTHDDYVLVGNDGSQLKNGVAIjGGPNSDGGI LKDVTVTYDKTSQTI KINHLNLGSGQKVVLTYDVRLKDNYI SNKF 

20 YNTJtntfRTTLSPKSEKEPNTIRD^ 

DVTTKNDGKI YFKALQDGNYKL YEI SSPDGYIEVKTKPVVTFTIQNGEVTl^KADPNANKNQI GYLEGNGKHL I TNTPKRPPGVFP 
KTGGIGTIVYILVGSTFMILTICSFRRKQL 

The nucleotide and amino acid sequences of GBS 147 in Ref. 3 are SEQ ID 8525 and SEQ 
25 ID 8526. These sequences are set forth below as SEQ ID NOS 7 and 8: 
SEQ ID NO. 7 

GTGGATAAACATCACTCAAAAAAGGCTATTTTAAAGTTAACA 

CAAGAATTAAAAAACCAAGAGCAATCACCTGTAAT^ 

GTAAGAGCTGCTTCTGCTAGTAATACAGCGAAAGAAAT^ 
30 AAAAACCTTGATACGTCTAATTTGGGGGCTGATCTT 

AATGCTTCAACTGCAATAGCACAGAAAGTTCCCTCAGCAT^ 

ATAACAAAATTACAAGCGATAACCCAAAGAGGAAAGGG 

GATAGCCCAAAAGATGATAAGCACAGCTTTAAAACTAAGACAGAATTT 

GATAAGATTGTTTTTGCACATAACTAC 
35 AATATTTCGCATGGTACACACGTTGCTGGTATTTTT^ 

GCTCAAGTCTTATTAATGCGTATTCCAGATAAAATTGATTC 

GCAAAAACGATTAATATGAGTATTGGAAAAACAGCTC 

GGCGTTGCAGTTGTTGTGGCTGCCGGAAATGAAGGCGCATTTGGTATGGATTATAGCAAACCATTATCAACTAATCCTGACTACGGTACGGTTAAT 

AGTCCAGCTATTTCTGAAGATACTTTGAGTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAACAACTATTGAAGGTAAGTTA 
40 GTTAAGTTGCCGATTGTGACTTCTAAACCTTTTGACAAAGGTAAGGCCT 

AAGGACTTTAAAGGTAAGATTG CATTAATTGAG CGTGGTGGTGGACTTGATTTTATGACTAAAATCACTCATG CTACAAATGCAGGTGTTGTTGGT 

ATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTTCTAATTCCTTACCGTGAATTACCTGTGGGGATTATTAGTAAAGTAGATGGCGAGCGT 

ATAAAAAATACTTCAAGTCAGTTAACATTTAACCAGAGT^ 

GTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCT 
45 ACAAGTATGGCTTCACCACATGTTGCAGGATTAATGACAATGCTTCAAAGTCATTTGGCTGAGAAATATAAAGGGA 

TTGCTAGAATTGTCTAAAAACATCCTCATGAGCTCAGCAAC^^ 

GGTGTAGTTGATGCTGAAAAAGCTATCCAAGCTCAATATTATATTACTGGAAACGATGGCAAAGCTAAAATTAATCTCAAACGAATGGGAGATAAA 
TTTGATATCACAGTTACAATTCATAAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAGCTAATGTAGC^ 

TTTGCCCTTAAACCACAAGCCTTGCTAGATACTAATTGGCAGAAAGTAATTCTTCGTGATAAAGAAACACAAGTTCGATTTACTATTGATGCTAGT 
50 CAATTTAGTCAGAAATTAAAAGAACAGATGGCAAATGGTTATTT^ 

ATGAGTATTCCTTTTGTAGGATTTAATGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATTTATAAGACGCTTTCTAAAGGTAGTTTCTAC 
TATAAACCAAATGATACAACTCATAAAGACCAATTGGAGTAC^ 

GCGTCTTGGGGCTATGTTGATTATGTCAAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTCCAAAAAGAATTA 
AAGGTTGAGGATAAAACAATTGATCTTTTGGAAAGAGATGCAGC^ 
55 ATC^CTCCCCAGGCAACTTTCTTAAGAAATGTTAAGGATATTT^ 
CCATCTTATCGTAAAAATTTCCATAATAATCCAA^ 
AAAGTTGTAGCAGATGGTTTTTATACTTATCGCTTACGTTACAC 

AGTACTAAGTCACCAAATCTTCCTTCACGAGCTCAGTTTGATGAAACTAATCGAACATTAAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCT 
ACATATCGTTTACAATTAGTTTTATCTCATGTTG^ 
60 AAAGTGACACTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTA 
TTCGCAACGGTAAAATTGTCTGATCTCTTGAATAAGGCAGTAGTA^ 

AACTTGAAAAAAGAACCTATGTTTATTTCTAAAAAAGAAAAAGTAGTAAACAAGAATCTAGAAGAAATAATATTAG 
ACTACTCAATCATTGTCTAAAGAAATAACTAAATCAGGAAATGAGAAAGT 
TCACCTAAACATAACGGGGATTCTGTTAACCATACCTTACCT 
65 TCTAGTTTACTTCTTTATTTGAAACCCAAAAAGACTAAAAATAATAGTAAA 

SEQ ID NO. 8 

VDKHHSKKAI LKLTL ITTS I LLMHSNQVNAEEQELKNQEQSPVI ANVAQQPS PS VTTNTVEKTSVTAASASNTAKEMGDTSVKNDKTEDELLEELS 
KNLDTSNLGADLEEEYPSKPETTNNKESNWTNASTAI AQKVPSAYEEVKPESKSSLAVLDTSKITKLQAITQRGKGNWAI IDTGFD INHDI FRL 
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DSPKDDKHSFKTKTEFEELKAKHNITYGKWVNDK^ 

AQVLLMRI PDKIDSDKFGEAYAKAI TDAVNLGAKT INMS I GKTADSLIALNDKVKIALKLASEKGVAVVW 

SPAI SEDTL.S VAS YESLKT I SEWETT I EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKDFKGKIAL I ERGGGLDFMTKITHATNAG WG 
I VI FNDQEKRGNFL I PYRELPVGI I SKVDGERI KNTS SQLTFNQSFEWDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE I YSSTYNNQYQTMSG 
5 TSMAS PHVAGLMTMLQSHLAEKYKGMNIjDSKKLLiELSKNI LMSSATALYSEEDKAFYSPRQQGAGVVDAEKAI QAQYYITGNDGKAKINLKRMGDK 
FD I TVT I HKLVEGVKEL Y YQANVATEQVNKGKFAL KP QALLDTNWQKVI LRDKETQVRFT I DAS QFS QKLKEQMANG YFLEGFVRFKEAKDSNQEL 
MSIPFVGFNGDFAJSTLQALETPIYKTLSKGSFYYK^ 

KVEDKTI HLLERDAANNPYFAI SPNKDGNRDE ITPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS YRKNFHNNPKQSDGHYRMDALQWSGLDKDG 
KVVADGFYTYRLRYTPVAEGANSQESDFKVQVSTKSPNIjPSRAQ 
1 0 KVTLPiCr VKI GE S EVAVDPKALTLVVEDKAGNFAT VKL SDLiLNKA WS E KENAI V I SNS FKYFDNLKKEPMF I S KKE KWNKNLEE 1 1 LVKPQTT V 
TTQSLSKE ITKSGNEKVLTSTNNNS SRVAKI I S PKHNGDS VNHTLPSTSDRATNGLFVGTIALLS SLLLYLKPKKTKNNSK 

The nucleotide and amino acid sequences of GBS 173 in Ref. 3 are SEQ ID 8787 and SEQ 
ED 8788. These sequences are set forth below as SEQ ID NOS 9 and 10: 
15 SEQ ID NO. 9 

ATGAAACGTAAATACTTTATTCTTAATACGGTGACG 

AGTGCTTCAGTAGTTCCTACTACAAATACTATCGTTCAAACTAATGACAGTAATCCTACCGCAAAATTTGTATCAGAATCAGGACAATCTGTAATA 
GGTCAAGTAAAACCAGATAATTCTGCGGCGCTTACAACAGTTGACAC 

CCTGTCGTTGAGAGTACTTCTACTAAGTTAACTGAAGAGACTTACAAACAAAAAGATGGTCAAGATTTAGCCAAGATGGTGAGAAGTGGTCAAGTT 
20 ACTAGTGAGGAACTCGTTAATATGGCATACGATATTATTGCn'AAAGAAAACCCATCTTTAAATGCAGTCATTACTACTAGACGCCAAGAAGCTATT 
GAAGAGGCTAGAAAACTTAAAGATAC<2AATCAGCCGTTTTTAGGT^ 
AATAATGGCTTGATCTATGCAGATGGAAAAATTAGCACATTTC 

ACGAACTTTCCAGAGTATGGGTGGCGTAATATAACAGATTCTAAATTATACGGTCTAACG CATAATC CTTGGGATCTTGCTCATAATG CTGGTGG C 
TCTTCTGGTGGAAGTGCAGCAGCCATTGCTAGCGGAATGACGCCAATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTCCATCTTCTTGG 

25 ACGGGCTTGGTAGGTTTAAAACCAACAAGAGGATTGGTGAGTAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTAACTAAGTCA 
TCTAGAGACGCAGAAACATTATTAACTTATCTAAAGAAAAGCGATCAAACGCTAGTATCAGTTAATGATTTAAAATCTTTACCAATTGCTTATACT 
TTGAAATCACCAATGGGAACAGAAGTTAGTCAAGATGCTAAAAACGCTATTATGGACAACGTCACATTCTTAAGAAAACAAGGATT 
GAGATAGACTTACCAATTGATGGTAGAGCATTAATGCGTGATTATTCAACCTTGGCTATTGGCATGGGAGGAGCTTTTTCAACAATTGAAAAAGAC 
TTAAAAAAACATGGTTTTACTAAAGAAGACGTTGATCCTATTACTTGGGCAGTTCATGTTATTTATCAAAATTCAGATAAGGCTGAACTTAAGAAA 

30 TCTATTATGGAAGCCCAAAAACATATGGATGATTATCGTAA 

AGTTTAG C CC CTCTAAATACAGATCCATATGTAACAGAGGAAGATAAAAGAGCGATTTATAATATGGAAAACTTGAG C CAAGAAGAAAGAATTGCT 
CTCTTTAATCGCCAGTGGGAGCCTATGTTGCGTAGAACACCTTTTACACAAATTGCTAATATGACAGGACTCCCAGCTATCAGTATCCCGACTTAC 
TTATCTGAGTCTGGTTTACCCATAGGGACGATGTTAATGGCAGGTC 

CATGGTTTTAATGTTAAATGGCAAAGAATAATAGATAAAGAAGTGAAACCATCTACTGGCCTAATACAGCCTACTAACTCCCTCTTTAA 
35 TCATCATTAGTAAATTTAGAAGAAAATTCACAAGTTACTCAAGTATCTATCT CTAAAAAATGGATGAAATCGTCTGTTAAAAATAAAC CATC CGTA 
ATGGCATATCAAAAAGCACTTCCTAAAACAGGTGATACAGAAT CAAGCCTATCTC CAGTTTTAGTAGTAACC CTTTTATTAGCTTGTTTTAG CTTT 
GTAACAAAAAAGAATCAGAAAAGT 

SEQ ID NO, 10 

40 MKRKYFILNTVTVLTLAAAMOTS 

PVVESTSTKLTEETYKQKDGQDIANIW EEARKLKDTNQPFLGVPLLVKGLGHS I KGGET 

NNGLIYADGKISTFDSSYVKKYKDLGFI ILGQTNFPEYGWRNITDSKLYGLTHNPWDLAHNAGGSSGGSAAAIASGMTPIASGSDAGGSIRIPSSW 
TGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAE 

EIDLP IDGRALMRDYSTLAI GMGGAFST I EKDLKKHGFTKEDVDP ITWAVHVI YQNSDKAELKKS IMEAQKHMDDYRKAMEKLHKQFP I FLS PTTA 
45 SLAPLNTDPYVTEEDKRAI YNMENLSQEERIALFNRQWEPMLRRTPFTQIANMTGLPAI S I PTYLSESGLPIGTMLMAGANYDMVLI KFATFFEKH 
HGFNVKWQRI IDKEVKPSTGLI QPTNSL.FKAHSSLVNLEENSQVTQVS I SKKWMKSSVKNKPSVMAYQKAIiPKTGDTESSLSPVLVVTLLIACFSF 
VTKKNQKS 

The nucleotide and amino acid sequences of GBS 276 in Ref. 3 are SEQ ID 8941 and SEQ 
50 ID 8942. These sequences are set forth below as SEQ ID NOS 1 1 and 12: 
SEQ ID NO. 11 

TTGCGTAAAAAACAAAAACTACCATTTGATAAACTT 
ACTGTGACAGAAGACACTCCTGCTACCGAACAAGCCGTAGAACCCCC^ 
ACCTCACAAACTCCTAGTGATGTAGGAGAAAGAGTAGC^GAT 
55 ACCTCAAAAGCGACTATTAGGGATTTGAACGACCCTT^ 

ATTGATGCTGGTTTTGATAAAAATCATGAAGCGTGGCGCTT^ 

AAAGAGCACGGTATTACCTATGGCGAGTGGGTCAATGATAAGGTTGCTTATTACCACGACTATAGTAAAGATGGTAAAAACGCTGTTGATCAAGAA 
CACGGCACACACGTGTCAGGGATCTTGTCAGGAAATC 

CTTTTGATGCGTGTCGAAATTGTAAATGGACTAGCAGACTATGCTCGTAACTACGCrCAAGCTATCAGAGATGC 
60 ATTAATATGAGCTTTGGTAATGCTGCACTAGCTTACGCC^ 

ATTGTGACCTCAGCTGGTAATGATAGTAGCTTTGGGGGCAAGCC^ 
GCGGCAGATTCAACATTGACAGTTGCTTCTTACAGCCC^ 

ATGCCTGTTATTTCAACAAACCGTTTTGAGCCAAACAAGGCTTACGACTATGCTTATGCTAATCGTGGT 
GAAGGTAAGATTGCCCTTATTGAACGTGGCGATATTGATT^ 
65 GAC^TCAAGACAAGGGCTTCCCGATTGAATTGCCAA^ 

AATCCCCC^AAAACCATTACCTTCAATGCGACACCTAAGGTA^ 
GCTGACGGCAATATTAAACCGGATATTGCAGCACCCGGCCAAGA^ 
ATGTCTGCACCATTGGTAGCGGGTATCATGGGACTGTTGCAAAA 
GCTAAGAAAGTATTGATGAGCTCAGCAACTGCCCTATAT^ 
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GCTAAAAAAGCTTCAGCAGCAACGATGTATGTAAC^^ 
ACAGTAAC^GTTCACAAGAAATCTGATAAACCTC^ 
GCTCCTAAAGCATTGTATGAGACATCATGGCAAA^ 
AGCAAGGACTTGCTTGCCCAAATGAAAAATGGCTATTTCT 
5 CCATATATTGGTTTCCGAGGTGATTTTGGCAATCTGTCAGCCTTAGAAAAACCAATCTATGATAGCAAAGACGGTAGCAGCTA 
AATAGTGATGCCAAAGACCAATTAGATGGTGATGGATTACAGT^ 
ACGATTATTAAAGCTGTCAAAGAAGGGGTTGAAAACATAGAGGATA 
CAAGACGATGATAGCCACTACTATATCCACCGTCACGCT^ 
CAATTCCAAGGTACTTTCTTGCGTAATGCTAAAAAC^ 
10 CAAGTTGTTAAAAACTACAACAATGACTTGGCAAGCACACTTGGTTCAACCCGTTTTGAAAA 
GTTGTTGCTAACGGAACCTACACCTATCGTGT 
AATACGACACCTGAAGTCGCAACATCGGCAAGATTCT 

CGTGAGCGTATTGCTTACACTTATATGGATGAGGATCTGCCAACAACAGAGTATATTTCTCCAAATGAAGATGGTACCT 
GCTGAAAGAATGGAAGGCGCTACTGTTCCATTGA 
15 ACTAAGCTATTGGAGGGCCACTCTAATAAGCC^ 

TCAGGTCAAACACCAGATAAAAAAAAAGAAACTAAACCAGAAAAAGATAGTTC^ 

CGTACTCTAGAGAAACGATCTTCTAAGCGTGCTTTAGCTAC^ 

TTACATCTCCTTAAGTTAGTTATGACCACTTTCTTCTTGGGA 

20 SEQ ID NO. 12 

mrkkqklpfdklaialistsillnaqsdikantv^ 
apaktadtpatskatirdlndpshvktm 

dkvay yhd ys kdgknavdqehgthvs gi l s gnaps emke p yrlegam peaqlllmrve i vnglad yarnyaqai rdavnl gakvin 
ms fgnaalayanlpdetkkafdyaks kgvs i vtsagnds s fggrprlpladhpdygvvgtpaaads tltvas ys pdkqltetatvk 
25 tddhqdkempvi s tnrfepnkaydyayanrgtkeddfkdvegki al i ergdidfkdki anakkagavgvl i ydnqdkgfp ielpnv 
dqmpaafisrrdglllkdnppktitfnatpkvlptasgt^ 
lvagimgllqkqyetqypdmtpserld^kkv^ 
sdkfevtvtvhnksdkpqelyyqvtvqto 

fwfkqdptkeelmsipyigfrgdfgnlsalekpiydskdgssyyheansdakdqldgdglqfyalk™ 
30 kegven i ed iesseiteti fagtfakqddd shy y i hrhangkp yaai s pngdgnrd yvqf qgtflrnaknlvaeviidkegnvvwt s 
evteqwknyx^lastlgstrfektrwtc 
ltlaskpktsqpvyreriaytymdedlptteyispnedgtftlp 

nkpeqdgsdqapdkkpeakpeqdgsgqtpt)kkketkpekdssgqtpgktpqkgqssrtlekrsskraia 
nrlhll.klvmttfflg 

35 

The nucleotide and amino acid sequences of GBS 305 in Ref. 3 are SEQ ID 207 and SEQ ID 
208. These sequences are set forth below as SEQ ID NOS 13 and 14: 
SEQ ID NO. 13 

ATGGGACGAGTAATGAAAACAATAACAACATTTC 
40 ACGTTTGTTAGCTAAGTTAGGAGCAATAGTGACAGTTAATGA 
AAGAGGGTATTAAAGTGGTTTGTGGTAGTCATCCTTTAGAAT^ 
CCTTATAACAATCCTATGGTCAAAAAAGCATTAG 
ATCTCAGCTAATAGGTATTAGAGGCTCT^ 

GAGGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTGAAGTTGTTCAGGCTGCGAATGATAAAGATACTCTAGTTATGGAATTA 
45 TCAAGTTTTCAGCTAATGGGAGTTAAGGAATTTCGT 

TGGGTCTTTTGAAGATTATGTTGCTGCAAAATGGAATATCCAAAATCAAATGTCTTCATCTGAT 
AAGGTATTTCTAAAGAGTTAGCTAAAACTACT^ 

CAAGACAAGCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGGTGTCCCAGGAAGC 
TCTAGGAACTATTGCGGTTGCTAAACTGGCTGGT^ 
50 ACCGCTTGCAATCACTCGGTAAGGTTCATGGTATTAGTTTCTATAACGACAGCAAGTCAACTAATATATTG 

TTATCTGGCTTTGATAATACTAAAGTTATCCTAATTGCAGGAGGTCTTGATCGCGGTAATGAGTTTGATGAATTGATACCAGATAT 

CMTGGACTTAAACATATGGTTGTTTTAGGG^ 

ATGCTTTAGATGTTAGAGATGCGGTACATAAAGCTT^ 

T CATGGGACATGTATAAGAAT TT CGAAGTC CGTGGT GATGAATTCATTGATACTTTCGAAAGTCT TAGAGGAGAG 



55 



SEQ ID NO- 14 



MGRVMKTI TTFENKKVLVLGIiARSGEAAARLLAra 

PYmPMVKKALEKQIPVLTEVEI^YLVSESQLIGITGSNGKTTTTTMIAEVLNAGGQRGLLAGNIGFPAS 
SSFQLMGVKEFRPHIAVITNLMPTHLDYHGSFEDYVAAK^ 
60 QDKQI1FYKGENIMSVDDIGVPGSHNVENAI1ATIAVA 

LSGFDNTKVILIAGGLDRGNEFDELIPDITGLKHMVVLGESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVAQQGDV 
SWDMYKNFEVRGDEFI DTFESLRGE 

The nucleotide and amino acid sequences of GBS 313 are in Ref 3 are SEQ ID 4089 and 
65 SEQ ID 4090. These sequences are set forth as SEQ ID NOS 1 5 and 1 6 below: 
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SEQ ID NO. 15 

ATGAAACGTATTGCTGTTTTA&CTAGTGGTGGTGACGC 
GAAGTTTACGGCATCAACCAAGGTTACTATGGTATGGTGACAGGGGAT^^ 

GGAACGTTTTTACGTTCAGCACGTTATCCTGAATTTGCTGAACTTGAAGGTCAGCTTAAAGGGATTGAACAGCTTAAAAAACACGGTATT 

GTAGTAGTTATCGGTGGTGATGGTTCTTATCATGGTGCTATGCGTCTAACTGAGCACGGTTTCCCAGCTGTTGGTTTGCCGGGTACAATTGATAAC 

GATATCGTTGGCACTGACTATACTATTGGTTTTGACACAGCAGTTG 

AACCGTACTTTTGTTGTTGAGGTTATGGGAAGAAATGCAGGAGATATCGCTCTTTGGTCAGGTATCGCTGCAGGTGCAGATCAAATTATTGTTCCT 
GAAGAAGAGTTCAATATTGATGAAGTTGTCTCAAATGTTAGAGCTGGCT^ 

ATGAGTGGTGATGAGTTTGCAAAAACAATGAAAGCAGCAGGAGACGATAGCGATCTTCGTGTGACGAATTTAGGACATCTGCTCCGTGGTGGTAGT 
CCGACGGCTCGTGATCGTGTCTTAGCATCTCGTATGGGAGCGTACGCTGTTCAATTGTTGAAAGAAGGTCGTGGTGGTTTAGCCGTTGGTGTCCAC 
AACGAAGAAATGGTTGAAAGTCCAATTTTAGGTTTAGCAGAAGAAGGTGCTTTGTTCAGCTTGACTGATGAAGGAAAAATCGTTGTTAATAATCCG 
CATAAAGCGGACCTTCGCTTGGCAGCACTTAATCGTGACCTTGCCAACCAAAGTAGTAAA 

SEQ ID NO. 16 

MKRIAVLTSGGDAPGMNAAI RAWRKAI SEGMEVYGINQGYYGMVTGD I FPLDANSVGDT INRGGTFLRSARYPEFAELEGQLKG IEQLKKHGI EG 
WV I GGDGS YHGAMRLTEHGFPAVGLPGT I DND I VGTD YT I G FDTAVATAVENLDRLiRDT S ASHNRTF WE VMGRNAGD I AIi WSG I AAGADQ 1 1 VP 
EEEFN I DE WSNVRAG YAAGKHHQ 1 1 VLAEGVMSGDE FAICTMKAAGDD SDLRVTNLGHLLRGGS PTARDRVIASRMGAYAVQLLKEGRGGIiAVGVH 
NEEMVESP I LGLAEEGALFSLTDEGKI WNNPHKADLRLAALNRDIANQSSK 

The nucleotide and amino acid sequences of GBS 322 in Ref. 3 are SEQ ID 8539 and SEQ 
ID 8540. These sequences are set forth below as SEQ ID NOS 17 and 18: 
SEQ ID NO. 17 

ATGAATAAAAAGGTACTATTGACATCGACAATGGCAGCTTCGCTAT 

CGTACTGTTTCAGAGGTAAAGGCTGATTTGGTAAAGCAAGACAATAAAT 

GCAATGTCAATTGATATGAATGTCTTAGCAAAAATAAATAACAT 

CAGAAGAGTC^TACTGCCACTTCAATGAAAATAGAAACACCAGCAAC 

GTTTCTGTTGCAGACCAAAAAGTTTCTCTCAATACAATTTCGGAAGGTAT 

TCTTCTGCGCCAGCTTTGAAATCAAAAGAAGTATTAGCACAAGAGCAAGCTGTTAGTGAAGCAGCAGCTAATGAACA 
AAGTCGATTACTTCAGAAGTTCCAGCAGCTAAAGAGGAAGTTAAAC 

GCCGCTGAAACAC CAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAACTGTAGCAG CCC CTAGAGTGGCAAGTGTTAAAGTAGTCACTCCTAAAGTA 
GAAACTGGTGCATCACCAGAGCATGTATCAGCTCCAGCAGT^ 

AAGAGCGTTCCGGTAGCACAAAAAG CTCCAACAGCAACACCGGTAGCACAACCAG CTT CAACAACAAATGCAGTAGCTGCACATC CTGAAAATGCA 
GGGCTCCAACCTCATGTTGCAGCTTATAAAGAAAAAGTAGCGTCA^ 

CATGGTAAAGGTTTAGCAGTTGACTTTATTGTAGGTACTAATCAAGCACrTGGTAATAAAGTTGCACAGTAC 
AACATTTCATATGTTATCTGGCAACAAAAGTTTTACTCA 

GGCGTTACTGCCAACCACTATGACCACGTTCACGTATCATTTAACAAATAATATAAAAAAGGAAGCTATTTGGCTTCTTTTTTATATGCCTTGAAT 
AGACTTTCAAGGTTCTTATATAATTTTTATTA 

SEQ ID NO. 18 

MNKKVLLTSTMAASLLSVASVQAQETDTTWTARWSEVKADLVKQDNKSSYTVKYGDTIjSVISFAMS 

QKSHTATSMKI ET PATNAAGQTTAT VDLKTNQVS VADQKVSLNT I SEGMT PE AATT I VS PMKT YS SAPALKS KE VLAQEQAVS QAAANEQVS PAPV 

KSITSEVPAAKEEVKPTQTSVSQSTTVSPASVAAETPAPVAKVAPWTVAAPRVASVKVVTPKVETGASPEHVSAPAVPVT 

KSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQ 

NISWIWQQKFYSNTNSIYGPANTWNAMPDRGGVTANHYDHVHVSFNK 

The nucleotide and amino acid sequences of GBS 328 in Ref. 3 are SEQ ID 6015 and SEQ 
ID 6016. These sequences are set forth below as SEQ ID NOS 19 and 20: 
SEQ ID NO. 19 

ATGAAAAAGAAAATTATTTTGAAAAGTAGTGTTCTTGGTTTAGTCGCTGGGACTTCTATTATGTTCTCAAGCGTGTTCGCGGACCAAGTCGGTGTC 
CAAGTTATAGGCGTCAATGACTTTCATGGTGCACTTGACAATACTGGAACAGCAAA.TATGCCTGATGGAAAAGTTGCTAATGCTGGTACTGCTGCT 
CAATTAGATGCTTATATGGATGACGCTCAAAAAGATTTCA^ 

GCAAGTCCAGCCAACTCTGGGCTTCTTCAAGATGAACCAACTGTCAAAAATTTTAATGCAATGAATGTTGAGTATGGCACATTGGGTAACCATGAA 

TTTGATGAAGGGTTGGCAGAATATAATCGTATCGTTACTGGT^ 

GAAGCTGCAAAACAAGAAATTGTAGTGGCAAATGTTATTGATAAAG^ 

CCTGTAAATAACAAAAGTGTGAACGTTGGCjITTATCGGGATT 

TTTTTAGATGAAGCTGAAACAATCGTTAAATACGCCAAAGAATTACAAGCTAAAAATGTCAAAGCTATTGTAGT 

AGTAAAAATGATATTGCTGAAGGTGAAGCAGCAGAAATGATGAAAAAAGTCAATCAACTCTTCCCTGAAAATAGCGTAGATATTGTCTTTGCTGGA 

CACAATCATCAATATACAAATGGTCTTGTTGGTAAAACTCGTATTGTACAAG CGCTCT CT CAAGGAAAAGCCTATGCTGATGTACGTGGTGTCTTA 

GATACTGATACACAAGATTTCATTGAGACCCCTTCAGCTAAAGTAATTGCAGTTGCTCCTGGTAAAAAAACAGGTAGTGCCGA 

GTTGACCAAGCTAATACTATCGTTAAACAAGTAACAGAAGCTAAAATTGGTACTGCCGAGGTAAGTGTCATGATTACGCGTTCTGTTGATCAAGAT 

AATGTTAGTC CGGTAGG CAG CCTCATCACAGAGGCTCAACTAG CAATTGCTCG AAAAAGCTGGCCAGATATCGATTTTGCCATGACAAATAATGGT 

GGCATTCGTGCTGACTTACTCATCAAACCAGATGGAACAATCACCTGGGGAGCT^ 

GAAATTACTGGTAGAGATCTTTATAAAGCACTCAACGAACAATACGACCAAAAA 

ACAGATAATAAAGAGGGCGGGGAAGAAACACCATTTAAAGTTGTAAAAGCTTATAAATCAAATGGTGAGGAAATCAATCCTGATG 
TTAGTTATCAATGACTTTTTATTCGGTGGTGGTGATGGC 

TTTATGGCCTATATCACTGATTTAGAAAAAGCTGGTAAAAAAGTGAGCGTTCCAAATAATAAACCTAAAATCTATGTCACTATGAAGA 
GAAACTATTACACAAAATGATGGTACACATAGCATTATTAA 



WO 2004/041157 



PCT/US2003/029167 



ACTTTAAACCAAACAAAATCAAAATCTACAAAAATCAACCCT 
AGAAATTATGGCAAACCATCAAACTCGACTACTGTAAAATC^ 
TTTGGTGTTGGACTTATAGGAATTGCTTTAAATACAAAGAAAAAACATATGAAA 

5 SEQ ID NO. 20 

MKKKI I LKSS VLGLVAGTS IMPS S VFADQVGVQVI GVNDFHGALDNTGTANMPDGKVANAGTAAQLDAYMDDAQKDFKQTNPNGES I RVQAGDMVG 
ASPANSGLLQDEPTVKNFNAMNVEYGTLGNHEFDEGLAEYm 

PVNNKS VNVGFI G IVTKD I PNLVLRKNYEQYEFLDEAET I VKYAKELQAKNVKAI WLAHVPATSKND I AEGEAAEMMKKVNQLFPENSVD I VFAG 
HNHQ YTNGIjVGKTR I VQALS QGKAYAD VRG VLDTDTQDF I ET P SAKVT AVAPGKKTGS AD I QAI VDQANT I VKQVTEAKI GTAEVS VMITRS VDQD 
1 0 NVS PVGSL I TEAQLA I ARKS WPD I D FAMTNNGG I RADLIiI KPDGT I T WGAAQAVQ PFGN I LQWE I TGRDLYKALNE QYDQKQNF FL Q I AGLRYT Y 
TDNKEGGEETPFKVVKAYKSNGEEINPDAI<YKIjVINDFLFGGGDGFASFRNAKIiLGAINPDTEVFMAY 
ET I TQNDGTHSI I KKLYLDRQGNIVAQEIVSDTLNQTKSKSTK^ 
FGVGL, IGI ALNTKKKHMK 

15 The nucleotide and amino acid sequences of GBS 330 in Ref. 3 are SEQ ID 8791 and SEQ 

ID 8792. These sequences are set forth below as SEQ ID NOS 21 and 22: 
SEQ ID NO. 21 

ATGAATAAACGCGTAAAAATCGTTGCAACACTTGGTCCTGCGGTTGAATTCCGTGGTGGTAAGAAGTTTGGTGAGTCTGGATACTGGGGTGAAAGC 
CTTGACGTAGAAGCTTCAGCAGAAAAAATTGCTC 
20 CAAGGAGCTCGTATGGCTACTGTTCGTAAAGCAGAAGAGAT^ 
GAACTTTTTGAAGATGGTGCAGATTTCCATTCA 

ATTGCATTGAATGTTGCTGGTGGACTTGACATCTTTGATGACGTTGAAGTTGGTAAGCAAATCCTTGTTGATGATGGTAAACTAGGTCTTACTGTG 

TTTGCAAAAGATAAAGACACTCGTGAATTTGAAGTAGTTGTTGAGAATGATGGCCTTATTGGTAAACAAAAAGGTGTAAACATCCCTTATACTAAA 

ATTCCTTTCC CAGCACTTGC AGAACGCGATAATGCTGATATC CGTTTTGGACTTGAGCAAGGACTTAACTTTATTGCTATCT CATTTGTACGTACT 
25 GCTAAAGATGTTAATGAAGTTCGTGCTATTTGTGAAGAAACT 

AATATTGATGAGATTATCGAAGCAGCAGATGGTATTATGATTGCTCGTGGTGATATGGGTATCGAAGTTCCATTTGAAATGGTTCCAGTTTACCAA 

AAAATGATCATTACTAAAGTTAATGCAGCTGGTAAAGC^ 

TCAGAAGTATCTGATGTCTTCAATGCTGTTATTC 

GTTCGTACAATGGCTACTATTGATAAAAATGCTCAAAC!^^ 
30 GATGTTATTGCATCTGCGGTTAAAGATGCAACACA^ 

TCTAAATTCCGTCCAGATGCAGACATTTTGGCTGTTACAT 

GCAGACAAACCAGCATCTACAGATGATATGTTTGAGGTTGCAGAAlCGTGTAGCACTTGA 
GTTGCAGGTGTTCCTGTAGGTACAGGTGGAACTAACACAATGCGTGTTCGTACTGTTAAA 

35 SEQ ID NO. 22 

MNKRVKI VATLGPAVEFRGGKKFGESGYWGESLDVEAS AEKI AQL I KEGANVFRFNFSHGDHAEQGARMATVRKAEE I AGQKVGFLLDTKGPE I RT 
ELFEDGAD FHSYTTGTKLRVATKQGI KSTPEVI ALNVAGGLD I FDDVEVGKQI IjVDDGKLGLTVFAKDKDTREFEVWENDGLIGKQKGVNI PYTK 
I P F PAIiAE RDNAD I RFGLEQGLNFI AI S FVRTAKDVNE VRAI CEETGXGHVKLFAKI ENQQG I DNIDE 1 1 EAADG I M I ARGDMGI EVPFEMVPVYQ 
KM 1 I TKVNAAGKAV I TATNMLETMTDKPRATRS E VSD VFNAV I DGTDATMLS GESANGKYPVE S VRTMAT I DKNAQTLLNEYGRLDS SAFPRNNKT 
40 D VI AS AVKDATHSMD I KLWT I TETGNTARA I SKFRPDAD I LAVTFDEKVQRSLM I NWGVI PVIiADKPASTDDMFEVAERVALEAGF VESGDN I VI 
VAGVPVGTGGTNTMRVRTVK 

The nucleotide and amino acid sequences of GBS 338 in Ref. 3 are SEQ ID 8637 and SEQ 
ID 8638. These sequences are set forth below as SEQ ID NOS 23 and 24: 
45 SEQ ID NO. 23 

TTGTCTGC TATAATAGACAAAAAGGTGGTGATATTTATGTATTTAGCAT TAAT CGGTGATATCATTAATTCAAAACAGATACT TGA 
ACGTGAAACTTTCCAACAGTCTTTTCAGCA 
TTACAGCTGGTGATGAATTTCAAGCTTTATTGAAAC 
♦ CCTGTTAATGTAAGGTTCGGCCTCGGTACAGGAA^ 
50 CTACTGGCATGCTCGCTCAGCTATTAAT 

AAGACCAAAACCTTGAATTAACACTAAATAGTCTCATT^ 

ATGCTTGAGCACT TAATACT TCAAGATAATTATCAAGAACAAT TTCAACAT CAAAAGTTAGCC CAACTGGAAAATATTGAACCTAG 

TGCGCTGACTAAACGCCTTAAAGCAAGCG^TCTGAAGACT 

GCACTCAAACTAAAGGGGGAAGCTATGATTTC 



55 



60 



SEQ ID NO. 24 

MSA I IDKICWIFMYLALIGDIINSKQILERETFQQSFQQLMTELSDVYGEELISPFTITAGDEFQALLKPSKKVFQI IDHIQLALKPVNVRFGIjGTG 
NI I TS INSNES I GADGPAYWHARSAINH I HDKNDYGT VQVAI CLDDEDQNLELTLNSL I SAGDFI KSKWTTNHFQMLEHL ILQDNYQEQFQHQKLAQ 
LENIEPSALTKRLKASGLKIYIiRTRTQAADLLVKSCTQTKGGSYDF 

The nucleotide and amino acid sequences of GBS 358 in Ref. 3 are SEQ ID 3183 and SEQ 
ID 3184. These sequences are set forth below as SEQ ID NOS 25 and 26: 
SEQ ID NO. 25 
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ATGTTTTATACAATTGAAGAGCTGGT 

GGTAGAAGTCGTGAAGAAATTCGTTATATTATGTCCCGAAATCTTGAAGT 
AGTGGTTTAACAGGCGGTGATGCTGTCAAGATGGATCAATATTTACAATC^ 

GCTATGGCTGTTAATGAGTTAAATGCTAAGATGGGACTGGTCTGTGCAACACCAACTGCAGGTAGTGCAGGATGTTTACCAGCTGTGATTTCTAC1A 
5 GCCATTGAAAAGCTTAATTTAACAGAAGAAGAGCAACTTGATTTTCTATTTACAGCCGGCGCATTTGGTCTCGTCATTGGTAATAATGCCTCTATC 
TCAGGTGCAGAAGGAGGTTGCCAAGCTGAAGTTGGGTCAGCTAGTGCTATGGCTGCGGCTGCTTTAGTTATGGCTGCTGGAGGTACTCCTTTCCAA 
GCTAGCCAAGCTATAGCATTTGTTATTAAAAATATGCTTGGACTTATCTGTGACCCTGTTGCAGGTTTAGTTGAAGTCCCTTGTGTGAAGCGGAAT 
GCTCTTGGATCAAGTTTTGCACTTGTTGCTGCTGATATGGCCTTGGCTGGTATTGAATCGCAAATTCCAGTAGATGAAGTTATTGATGCAATGTAT 
CAAGTTGGATCAAGTTTACCGACTGCTTTTCGTGAGACTGCAGAAGGAGGACT 
1 0 GGGGAA 

SEQ ID NO. 26 

MFYTIEELVEQANSQHKGNIAELMIQTEIEMTGRSREEIRYIM 
AMAWELNAKMGLVCATPTAGSAGCLPAVISTAIEKLNL^^ 
1 5 ASQAI AFVI KNMLGL I CD P VAGLVE VPC VKRNALG S S FALVAADMALAG I E S Q I PVDEVI DAM YQ VGS SLPTAFRETAEGGLAATPTGRRYSKE I F 
GE 

The nucleotide and amino acid sequences of GBS 361 in Ref. 3 are SEQ ID 8769 and SEQ 
ID 8770. These sequences are set forth below as SEQ ID NOS 27 and 28: 
20 SEQ ID NO. 27 

ATGAGCGTATATGTTAGTGGAATAGGAATTATTTCITCTTTGGGAAAGAATTATAGCGAGCATAAACAGCATCTCTTCGACTTAAAAGAAGGAATTT 
CTAAACATTTATATAAAAATCACGACTCTATTTTA 

ACGTAATTTTAAATTTGCTTTTACCGCTTTTGAAGAGGCTCTTGCTTCTTCAGGTGTTAATTTAAAAGCTTATCATAATATTGCTGTGTGTTTAGGG 

ACCTCACTTGGGGGAAAGAGTGCTGGTCAAAATGCCTTGTATCAATTTGAAGAAGGAGAGCGTCAAGTAGATGCTAGTTTATTAGAAAAAGCATCTG 
25 TTTACCATATTGCTGATGAATTGATGGCTTATCATGATATTGTGGGAGCTTCGTATGTTATTTCAACCGCCTGTTCTGCAAGTAATAATGCCGTAAT 

ATTAGGAACACAATTACTTCAAGATGGCGATTGTGATTTAGCTATTTGTGGTGGCTGTGATGAGTTAAGTGATATTTCTTTAGCAGGCTTCACATCA 

CTAGGAGCTATTAATACAGAAATGGCATGTCAGCCCTATTCTTCTGGAAAAGGAATCAATTTGGGTGAGGGCGCrGGTTTTGTTGTTCTTGTCAAAG 

ATCAGTCCTTAGCTAAATATGGAAAAATTATCGGTGGTCTT^^ 

ACAGATTGCAAAGCAGCTAGTGACTCAAGCAGGTATTGACTACAGTGAGATT^ 
30 ATGGAAAAAAATATGTATGGTAAGTTTTTCCCGACAACGACA^ 

TCGAATTGATTAATTGTTTAGCGGCAATAGAGGAAC^GACTGTACCAG 

TCATCIAAAAGAGAGAATACCCAATAAGAAATGCTTTAAATTT^ 

CCTCTAGAAACAT.TACCTGCTAGAGAAAAT CTTAAAATGGCTATCTTATCATCTGTTG CTTC CATTTCTAAGAATGAATCACTTTCTATAACCTATG 
AAAAAGTTGCTAGTAATTTC^CGACTTTGAAGCATTAC^ 
35 TGATTTTTCC^AAATGGTTGCCGTAACAACAGCTCAAGCACTAATAGAAAGCAATATTAATCTAAAAAAACAAGATACTTC 
TTTACAACACTTTCTGGACCAGTTGAGGTTGTTGAAGGTATTO 

TTACAGTAATGAATG CAGCAGCTGGTATGCTTTCTATCATTTTTAAAATAACAGGTC CTTTATCTGTCATTT CGACAAATAGTGGAGCGCTTGATGG 
TATACAATATGCCAAGGAAATGATGCGTAACGATAATCTAGACTATGTGATTCTTGTTTCTGCTAATCAGTGGACAGACATGAGTTTTATGTGGTGG 
CAACAATTAAACTATGATAGTCAAATGTTTGTCGGTTCTGAT^ 
40 TAGGTAGTAAACAATTAAAATATAGCCATAAAACAT 

ATGCCAAACCTTGCTTCTGGTCAGTTTGGATTTTCATCTAATGGTGCTGGTGAAGAACTGGACTATACTGTTAATGAAAGTATAGAAAAGGGCTATT 
ATTTAGTCCTATCTTATTCGATCTTCGGTGGTATCTCTTTTGCTATTATTGAAAAAAGG 



45 SEQ ID NO. 28 

MSVYVSG I GI I S SLGKNYSEHKQHLFDLKEG I SKHLYKNHDS I LES YTGS I TSDPEVPEQYKDETRNFKFAFTAFEEAIASSGVNLKAYHNIAVCLG 
TSLGGKSAGQNALYQFEEGERQVDASLLEKASVYHIADELMAYHDIVGASYVISTACSASNNAVILGTQLLQDGDCDLAICGGCT 
LGAINTEMACQP YSSGKG INLGEGAGFVVLVKDQSIiAKYGKI IGGLI TSDGYH I TAPKPTGEGAAQI AKQLVTQAGI DYSEI D YI NGHGTGTQANDK 
MEKNMYGKFFPTTTIjISSTKGQTGHTLGAAGI IELINCLAAIEEQTVPATKNEIGIEGFPENFVYHQKREYPIRNALNFSFAFGGNNSGVLiLSSLDS 
5 0 PLETLPARENLKMAIIiSSVAS I SKNESLS I TYEKVASNFNDFEALRFKGARPPKTVNPAQFRKMDDFSKMVAVTTAQALIESNINLKKQDTSKVGIV 
FTTLSGPVE WEG I E KQ I TTEG YAHVS ASRFP FTVMNAAAGMLS 1 1 FKITGPLS VI STNSGALDG I QYAKEMMRNDNLDYVI LVSANQWTDMSFMWW 
QQLNYDSQMFVGSDYCSAQVLSRQALDNSP 1 1 LGSKQLKYSHKTFTDVMT I FDAALQNIiLSDLGIiTI KD I KGFVWNERKKAVSSDYDFLANLSEYYN 
MPNIASGQFGFSSNGAGEELDYTVNES I EKGYYLVLS YS I FGGI S FAI I EKR 

55 The nucleotide and amino acid sequences of GBS 404 in Ref. 3 are SEQ ID 8799 and SEQ 

ID 8800. These sequences are set forth below as SEQ ID NOS 29 and 30: 
SEQ ID NO. 29 

ATGAAAATAGATGACCTAAGAAAAAGCGACAATGTTGAAGATCGTCGCTCCAGTAGCGGAGGTTCATTCTCTAGCGGAGGAAGTGGATTACCGATT 
CTTCAACTTTTATTGCTGCGAGGGAGTTGGAAAACCAAGCTTGTGGTTTTAATCATCTTACTGCTACTTGGCGGAGGGGGACTAACCAGCATTTTT 
60 AATGACTCATCCTCACCTTCTAGTTACCAATCTCAGAATGTCT 

GTCCTTGGCTCAACTGAGGATTTCTGGTCACAAGAATTCCAAACC 

ATTCAAACAGGTTGTGGTATAGGTGAATCTGCTTCAGGACCA 

TTATCACATAAATATGGTGCTACTGGTGATTTTGCTAT 

GATAAGTATAATAGAATGCGACACGGACTTACTAAGAAAGAAGCAAATGCT^ 
65 TGGGCTCACTACATCAGGGGAAAAAATCTCTTAGAACAAGGA 

AAAGAAACCTACGGAAAATTAGTGCCTGATAGCTTTACCCATGGAACAG 
- ATCCAACACGGTGATACTTTCTCCGTAGAACATCTA 
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SEQ ID NO. 30 

MKI DDLRKSDNVEDRRS S SGGS FSSGGSGLPI LQLLLLRGSWKTKLVVL I ILLLLGGGGLTS I FNDSSSPSSYQSQNVSRSVDNSATREQIDFVNK 
VLG STEDF WS QE FQTQGFGNYKE PKLVLYTNS I QTGCG I GES ASGPFY CS ADKKI YLD I S FYNELSHKYGATGDFAMAYVI AHEVGHHI QTELGI M 
DKYNRMRHGLTKKEANALjNVRLELQADYYAGV^ 
5 IQHGDTFSVEHL 

The nucleotide and amino acid sequences of GBS 656 in Ref. 3 are SEQ ID 9323 and SEQ 
ID 9324. These sequences are set forth below as SEQ ID NOS 31 and 32: 
SEQ ID NO. 31 

10 ATGAAAAGATTACATAAACTGTTTATAACCGTAATTG CTACATTAGGTATGTTGGGGGTAATGAC CTTTGGTCTTCCAACGCAGCCG CAAAACGTA 

ACGCCGATAGTACATGCTGATGTCAATTCATCTGTTGATACGAGCCAGGAATTTCAAAATAATTTAAAAAATGCTATTGGTAACCTACCATTTCAA 

TATGTTAATGGTATTTATGAATTAAATAATAATCAG 

CAAAGACTATCAACTGCTAATGCAATGCTTGATAGAACCA^ 

TTAGGTTGGCATCAAGTAGCTACTAATGACCATTATGGACAT 
15 TGGGATG CTTCCGTGTCAAATCCTCAAAATGTTGTCACACAAACAGCT CATTCCAACCAATCAAATCAAAAAATCAAT CGTGGACAAAATTATTAT 

GAAAGCTTAGTTCGTAAGGCGGTTGACCAAAACAAACGTGTTCGTTACCGTGTAACTCCATTGTACCGTAATGATACTGATTTAGTTCCATTTG 

ATGCACCTAGAAGCTAAATCACAAGATGGCAC^TTAGAATTTA^ 

GAAATAACACTAAAT 

20 SEQ ID NO. 32 

MKRLHKLFITVIATLGMLGVMTFGLPTQPQNWPI 
QRLSTANAMLDRTIRQYQNRRDTTLPDANWKPLGWH^ 

ESLWKAVDQNKRWYRWPLYRNDTDLVPFAMHLEAKSQDGTLEFNVAI PNTQAS YTMDYATGE ITLN 

25 The nucleotide and amino acid sequences of GBS 690 in Ref. 3 are SEQ ID 9965 and SEQ 

ID 9966. These sequences are set forth as SEQ ID NOS 33 and 34 below: 
SEQ ID NO. 33 

ATGAGTAAACGACAAAATTTAGGAATTAGTAAAAAAGGA 
GTACAATCTCAACCTAATAAGAGTGCAGTAAAAACTA 
30 AAAGCTAAGGCTAATCAAGAACAGTATGTGTATTTTGATGCT 
" GGTCAGCAGTTAGTTCAATAT 
CTAAAGACAACAGGAAGTCTTCCAGCTATGGAA^ 

AATCGTCTACAGCAAAATTATCAAAGTCAAGCTAATGCTTCATACAACC^ 

GTAAATAAAGCACAAAAAGCATTGAATGATACTGTTATTACAAGTGACGTATCAGGGACAGTTGTTGAAGTTAATAGTGA 
35 AAAACTAGTCAAGTACTTGTCCATGTAGCAACTGAAGGTAAACTCC 
CAGGCTGTTAAAATAAAATCTAAGGTCTATCCTGAC^^GGAAT 

AATGACTCTAATAACGGCTCTAGTGCTGTAAATTATAAATATAAAGTAGATATTACTAGCCCTCTCGATGCATTAAAACAAGGTTTTACCGTATCA 
GTTGAAGTAGTTAATGGAGATAAGCACCTTATTGTCCCTACAAGTTCTGTGATAAACAAAGATAATAAACACTTTGTTTGGGTATACAATGATTCT 
j^TCGTAAAATTTCCAAAGTTGAAGTCAAAATTGGTAA 
40 ACTAATCCAAGTAAAACCTTCAAGGATGGGCAAAA 

SEQ ID NO. 34 

MS KRQNLG I SKKGAI I SGLS VAL I WI GGFL WVQSQPNKSAVKTNYKVFNVREGS VSSSTLLTGKAKANQEQYVYFDANKGNRATVTVKVGDKI TAG 
QQIiVQYDTTTAQAAYDTANRQLNKVARQ INNLKTTGSLPAMES SDQS SSS SQGQGTQSTSGATNRLQQNYQSQANAS YNQQLQDLNDAYADAQAEVN 
45 KAQKALNDTVITSDVSGTVVEWSDIDPASKTSQVL^ KS KVYPDKE WEGKI S Y I SNYPEAEANNNDS 

NNG S S AVN YKYKVD I TS PLDALKQGFT VS VE VVNGDKHli I VPT S S VI NKDNKH FVWVYND SNRKI S KVE VKI GKADAKTQE I L SGLKAGQ I WTNP S 
KTFKDGQKIDNIES IDLNSNKKSEVK 

The nucleotide and amino acid sequences of GBS 691 in Ref. 3 are SEQ ID 3691 and SEQ 
50 ID 3692. These sequences are set forth as SEQ ID NOS 35 and 36 below: 
SEQ ID NO. 35 

ATGAAAAAAATTGGAATTATTGTCCTCACACTACTGACCT 
TCTAAAATGCCTAAAATTGAAGGCTTCACCTATTATGGAAAAA^ 
TTATTAAAACTAGGTGTTAATGTTTCAAGTTACAGT^ 
55 ACTGCTGATGATACAGAAGCTATTGCCGCACAAAAACCTGAT^ 
CCAACTTTAGTTATTAAATATGGTGCAGAAAAT 
GTTAGCCAATGGAAAACTAAAACTCTCGCTGTCAAA 
AAAAATATCTATTTATATGGTAATAATTTTGGACGCGGTC 

GTCTTTAAAAAAGGGTGGTTTACCGTTTCGCAAGAAGCAATCGGTGATTACGTTGGAGATTATGCCCTTGTTAATATAAACAAA 
60 GCAGCTTCATCACTTAAAGAAAGTGATGTCTGGAAGAATT^ 

TCTGACCCTCTATCTTTAGAAGCTCAATTAAAATCATTTACAAAGGCTATCAAAGAAAATACAAAT 

SEQ ID NO. 36 
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MKKIGIIVLTLLTFFLVSCGQQTKQESTKTTIS 

GKQLKEAKKI1TADDTEAIAA.QKPDLIMVFDQDPNINTI1KKIAPTLVIKYGAQNYLDMMP 
KDLHHILKPNTTFTIMDFTOKNIYLYGNNFGR^ 

SSLKESDWKIJLPAVKKGHIIESlSraVFYFSDPLSLFAQLKSFTKAIKENTN 

Other preferred polypeptide antigens include: GBS4 (SEQ ID 2 from Ref. 3); GBS22 (SEQ 
ID 8584 from Ref. 3); and GBS85 (SEQ ID 216 from Ref. 3), including polypeptides having amino 
acid sequences with sequence identity thereto etc. 

The polypeptide is preferably not a C protein (alpha or beta or epsilon) or a R protein (Rib). 

The nucleotide and amino acid sequences of GBS 4 in Ref. 3 are SEQ ID 1 and SEQ ID 2. 
These sequences are set forth below as SEQ ID NOS 37 and 38: 
SEQ ID NO. 37 

ATGAAAGTGAAAAATAAGATTTTAACGATGGTAGCACTTACT 

ATACTGAC^CGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGAACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATC 

GAGTTCATCAAGTGAACCAGAAACAAATCCGTCAACTAATC 

AGAACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAfl 

GTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGG 

AGGAGATAGCCACTCAGATACTGTAATAGCATCTACGGGAGGGATTATTCTGTTATCATTAAGTTTTTACAATAAGAAAATGAAACTTTAT 

SEQ ID NO. 38 

MKVKNKILTNTVAIjTVLTCATYSS 

RTKTE I GNNKD I S SGTKVLI SEDS I KNFSKAS SDQEEVDRDES SSSKANDGKKGHSKPKKELPKTGDSHSDTVIASTGGI ILLSLSFYNKKMKLY 

The nucleotide and amino acid sequences of GBS 22 in Ref. 3 are SEQ 8583 and SEQ ID 
8584. These sequences are set forth below as SEQ ID NOS 39 and 40: 
SEQ ID NO. 39 

ATGAAAAGGATACGGAAAAGCCTTATTTTTGTTCTCGGAGTAGTTAC^ 
TGTCAGTAGTGACTAGCTTTTATCCAGTATATTCC^ 

TCATGGTTTTGAACC CT CATCAAGTGATGTTG CTGCCATTTATGATGCTGAT CTATTTCTTTATCATTCGCACACACTAGAAGCTTGGGCGAGACGT 
TTGGAACCTAGTTTGCATCACTCTAAAGTATCTGTAATTGAAGCTTCAAAAGGTATGACTTTGGATAAAGTTCATGGCTTAGAAGATGTAGAGGCAG 
AAAAAGGAGTAGATGAGTCAACCTTGTATGACCCTCACACTTG 

TAAAAAGGAT C CTAAAAACGCTAAGGTTTATCAA?U\AAATGCTGATCAATTTAGTGACAAGG CAATGGCTATTGCAGAGAAGTATAAG CCAAAATTT 
AAAGCTGCAAAGTOTAAATACTTTGTGACTTCACATACAGCATTCTCATACTTAGCTAAGCGATA 

CAACCGAGCAAGAACCTAGTGCTAAAAAATTAGCCGAAATTCAGGAGTTTGTGAAAACATATAAGGTTAAGACTATTTTTGTTG 

ACCTAAATTAG CTCAAGC AGTAG CTTCAGCTACTCGAGTTAAAATTG CAAGTTTAAGT CCTTTARAAGCAGTT CCCAAAAACAATAAAGATTACTTA 

GAAAATTTGGAAACTAATCTTAAGGTACTTGTCAAATCGTTAAATCAATAG 

SEQ ID NO. 40 

MKRIRKSLI FVLGWTLI CLCACTKQSQQKNGLSWTSFYPVYSITKAVSGDLNDI KMIRSQSGIHGFEPSSSDVAAIYDADLFLYHSHTLEAWARR 
LEPSLHHSKVSVIEASKGMTLDKVHGLEDVEAEKGVDEST^^ 

KAAKS KYF VT SHTAF S YLAKRYGLTQLG I AGVSTEQE PSAKKIAE I QEFVKTYKVKT I FVEEGVS PKIAQAVAS ATRVKI AS L S PLXAVP KNNKDYL 
ENLETNLKVLVKSLNQ 

The nucleotide and amino acid sequences of GBS 85 in Ref. 3 are SEQ ID 215 and SEQ ID 
216. These sequences are set forth below as SEQ ID NOS 41 and 42: 
SEQ ID NO. 41 

ATG CCTAAGAAGAAATCAGATACCCCAGAAAAAGAAGAAGTTGTCTTAACGGAATGG CAAAAG CGTAACCTTGAATTTTTAAAAAAACGCAAAGAAG 

ATGAAGAAGAACAAAAACGTATTAACGAAAAATTACGCTTAGATAAA 

AATTAAGAAGCTTCATTTTCCAAAGATTTCAAGACCTAAGATTC 

ATTAGAACTGCACCTATATTTGTAGTAGCATTCCTAGTCATO 

GTGGAAATCAGCATACACCTGATGATATTTTGATAGAGAAAAC 

TGAACAACGTTTAGCTGCAGAAGATGTATGGGTAAAAACAGCT 

ATTATTG CATATGCACATACAAAGCAAGGATAT CAACCTGTCTTGGAAACTGGAAAAAAGGCTGATC CTGTAAATAGTTCAGAGCTACCAAAGCACT 

TCTTAACAATTAACCTTGATAAGGAAGATAGTATTAAGCT^^ 

AAGTTTAGCTGATTCTAAAACGACACCTGACCTCCTGCTGTTA^ 

CTTCCTTTTTACAAACAAATTAAGAAGAACCTTAAGGAACC^ 

CCCCTGTTAAAGCAGAAGATACAAAAAATAAATCAACTGATAAAACACAAACACAAAA 
CTCAAATACTAATCAACAAGGACAACAGATAGC^ 
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SEQ ID NO. 42 

MPKKKSDTPEKEEVVLTEWQKRNLEFLKKRKEDEEEQK^ 

I RTAP I FWAFLVI LVS VFLLTPFSKQKT I TVSGNQHTPDD I L I EKTNIQKNDYFFSLI FKHKAI EQRLAAEDVWVKTAQMT YQFPNKFH I QVQENK 
5 1 1 AYAHTKQGYQPVLETGKKADPVNSSELPKHFLT INLDKEDS I KLL I KDLKALDPDL I SE I QVI SLADSKTTPDLLLLDMHDGNSIRIPLSKFKER 
LPFYKQI KKNLKEPS I VDME VGVYTTTNT I ESTPVKAEDTKNKSTDKTQTQNGQVAENSQGQTNNSNTNQQGQQIATEQAPNPQNVN 

GBS polypeptides of the invention may be present in the composition as individual separate 
polypeptides. It is preferred, however, that two or more (i.e. 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 

10 15, 16, 17, 18, 19 or 20) of the antigens are expressed as a single polypeptide chain (a 'hybrid 5 
polypeptide). Hybrid polypeptides offer two principal advantages: first, a polypeptide that may be 
unstable or poorly expressed on its own can be assisted by adding a suitable hybrid partner that 
overcomes the problem; second, commercial manufacture is simplified as only one expression and 
purification need be employed in order to produce two polypeptides which are both antigenically 

15 useful. 

The hybrid polypeptide may comprise two or more polypeptide sequences from the first 
antigen group. Accordingly, the invention includes a composition comprising a first amino acid 
sequence and a second amino acid sequence, wherein said first and second amino acid sequences are 
selected from a GBS antigen or a fragment thereof. Preferably, the first and second amino acid 

20 sequences in the hybrid polypeptide comprise different epitopes. 

The hybrid polypeptide may comprise one or more polypeptide sequences from different 
GBS serotypes. Accordingly, the invention includes a composition comprising a first amino acid 
sequence and a second amino acid sequence, said first amino acid sequence and said second amino 
acid sequence selected from a GBS serotype selected from the group consisting of serotypes la, lb, 

25 la/c, II, III, IV, V, VI, VII and VIII. The first and second amino acid sequence may be from the same 
GBS serotype or they may be from different GBS serotypes. Preferably, the first and second amino 
acid sequence are selected a GBS serotype selected from the group consisting of serotypes II and V. 
Most preferably, at least one of the first and second amino acid sequences is from GBS serotype V. 
Preferably, the first and second amino acid sequences in the hybrid polypeptide comprise difference 

30 epitopes. 

In one embodiment, the hybrid polypeptide comprises one or more GBS antigens from 
serotype V. Preferably, the hybrid polypeptide comprises a first amino acid sequence and a second 
amino acid sequence, said first amino acid sequence and said second amino acid sequence 
comprising a GBS antigen or a fragment thereof selected from the group consisting of GBS 80, GBS 
35 91, GBS 104, GBS 147, GBS 173, GBS 276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, 
GBS 338, GBS 358, GBS 361, GBS 404, GBS 656, GBS 690, and GBS 691. Preferably, the GBS 
antigen or fragment thereof is selected from the group consisting of GBS 80 and GBS 691. 
Preferably, the first and second amino acid sequences in the hybrid polypeptide comprise difference 
epitopes. 



14 



WO 2004/041157 



PCT/US2003/029167 



Hybrids consisting of amino acid sequences from two, three, four, five, six, seven, eight, 
nine, or ten GBS antigens are preferred. In particular, hybrids consisting of amino acid sequences 
from two, three, four, or five GBS antigens are preferred. 

Different hybrid polypeptides may be mixed together in a single formulation. Within such 
5 combinations, a GBS antigen may be present in more than one hybrid polypeptide and/or as a 
non-hybrid polypeptide. It is preferred, however, that an antigen is present either as a hybrid or as a 
non-hybrid, but not as both. 

Preferably, the GBS antigen in one of the hybrid polypeptides is GBS 80 or a fragment 
thereof. Accordingly, examples of two-antigen hybrids for use in the invention may comprise: (1) 

10 GBS 80 and GBS 91, (2) GBS 80 and GBS 104, (3) GBS 80 and GBS 147, (4) GBS 80 and GBS 
173, (5) GBS 80 and GBS 276, (6) GBS 80 and GBS 305, (7) GBS 80 and GBS 313, (8) GBS 80 and 
GBS 322, (9) GBS 80 and GBS 328, (10) GBS 80 and GBS 330, (11) GBS 80 and GBS 338, (12) 
GBS 80 and GBS 358, (13) GBS 80 and GBS 361, (14) GBS 80 and GBS 404, (14) GBS 80 and 
GBS 404, (15) GBS 80 and GBS 656, (16) GBS 80 and GBS 690, and (17) GBS 80 and GBS 691. 

15 Preferably, a two-antigen hybrid for use in the invention comprises GBS 80 and GBS 691 . 

Hybrid polypeptides can be represented by the formula NH 2 -A-{-X-L-} w -B-COOH, wherein: 
X is an amino acid sequence of a GBS antigen or a fragment thereof; L is an optional linker amino 
acid sequence; A is an optional N-terminal amino acid sequence; B is an optional C-terminal amino 
acid sequence; and n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15. 

20 If a -X- moiety has a leader peptide sequence in its wild-type form, this may be included or 

omitted in the hybrid protein. In some embodiments, the leader peptides will be deleted except for 
that of the -X- moiety located at the N-terminus of the hybrid protein i.e. the leader peptide of X { will 
be retained, but the leader peptides of X 2 ... X n will be omitted. This is equivalent to deleting all 
leader peptides and using the leader peptide of Xi as moiety -A-. 

25 For each n instances of {-X-L-}, linker amino acid sequence -L- may be present or absent. 

For instance, when -ti=2 the hybrid may be ^-XrL^-Lj-COOH, M^-Xj-Xs-COOH, 
NH 2 -Xi -Li -X2-COOH, NH2-X1-X2-L2-COOH, etc. Linker amino acid sequence(s) -L- will typically 
be short (e.g. 20 or fewer amino acids i.e. 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 
1). Examples comprise short peptide sequences which facilitate cloning, poly-glycine linkers (i.e. 

30 comprising Gly„ where n = 2, 3, 4, 5, 6, 7, 8, 9, 10 or more), and histidine tags (i.e. His„ where n = 3, 
4, 5, 6, 7, 8, 9, 10 or more). Other suitable linker amino acid sequences will be apparent to those 
skilled in the art. A useful linker is GSGGGG (SEQ ID 1), with the Gly-Ser dipeptide being formed 
from a BamBl restriction site, thus aiding cloning and manipulation, and the (Gly) 4 tetrapeptide 
being a typical poly-glycine linker. 
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-A- is an optional N-terminal amino acid sequence. This will typically be short (e.g. 40 or 
fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include leader sequences to 
direct protein trafficking, or short peptide sequences which facilitate cloning or purification (e.g. 
histidine tags i.e. His„ where n = 3, 4, 5, 6, 7, 8, 9, 10 or more). Other suitable N-terminal amino acid 
sequences will be apparent to those skilled in the art. If Xi lacks its own N-terminus methionine, -A- 
is preferably an oligopeptide (e.g. with 1, 2, 3, 4, 5, 6, 7 or 8 amino acids) which provides a N- 
terminus methionine. 

-B- is an optional C-terminal amino acid sequence. This will typically be short (e.g. 40 or 
fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include sequences to direct 
protein trafficking, short peptide sequences which facilitate cloning or purification (e.g. comprising 
histidine tags i.e. His„ where n = 3, 4, 5, 6, 7, 8, 9, 10 or more), or sequences which enhance protein 
stability. Other suitable C-terminal amino acid sequences will be apparent to those skilled in the art. 
Most preferably, n is 2 or 3. 

The saccharide antigen 

The saccharide antigen is generally the capsular polysaccharide of a GBS or a derivative 
thereof. Suitable derivatives include oligosaccharide (e.g. from 3 to 150, preferably 8 to 100, 
20 monosaccharide units) fragments of the polysaccharide (e.g. refs. 12 to 16), de-acetylated saccharides 
(Ref. 16), N-acroylated saccharides (16), saccharides with terminal aldehyde groups, etc. 

The saccharide is preferably conjugated to a carrier molecule to enhance immunogenicity 
(e.g. see refs. 4 to 23 etc.). In some embodiments of the invention the GBS saccharide is conjugated 
to a GBS protein as defined above, thereby giving a polypeptide/saccharide combination of the 
25 invention in a single molecule, hi other embodiments the GBS saccharide is conjugated to a non- 
GBS protein, in which case the conjugate will be combined with a separate GBS protein to give a 
polypeptide/saccharide combination of the invention. 

Non-GBS carrier polypeptides include tetanus toxoid, the N. meningitidis outer membrane 
protein (24), synthetic peptides (25, 26), heat shock proteins (27, 28), pertussis proteins (29, 30), 
30 protein D from H. influenzae (31), cytokines (32), lymphokines (32), hormones (32), growth factors 
(32), toxin A or B from C.difficile (33), iron-uptake proteins (34) etc. Preferred carrier proteins are 
the CRM197 diphtheria toxoid (35) and tetanus toxoid. 

The saccharide and polypeptide are joined covalently. This may involve a direct covalent 
bond between the saccharide and polypeptide, or indirect coupling via a linker or spacer may be used 
35 (e.g. via a B-propionamido linker (16), etc.). Any suitable conjugation chemistry may be used (e.g 
reductive animation (21) etc.). Linkage is preferably via a terminal saccharide in the polysaccharide. 

16 
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A single carrier molecule may carry saccharide antigens of a single type (e.g. saccharides 
derived from a single GBS serotype) or may carry multiple different antigens (e.g. saccharides 
derived from multiple GBS serotypes, all conjugated to the same carrier). 

The saccharides can, of course, be prepared by various means (e.g. purification of the 
5 saccharide from GBS, chemical synthesis, etc.), in various sizes (e.g. full-length, fragmented, etc.) 
and may be derivatised for linking to carriers. They are preferably prepared in substantially pure 
form (i.e. substantially free from other streptococcal saccharides) or substantially isolated form. 
Processes for preparing capsular polysaccharides from GBS are well known in the art (e.g. refs. 36 to 
39) and processes for preparing oligosaccharides from polysaccharides are also known (e.g. 
10 hydrolysis, sonication, enzymatic treatment, treatment with a base followed by nitrosation, etc. (12 to 
16)). 

As an alternative to using a saccharide antigen in non-conjugated combinations, a peptide 
mimetic of the GBS capsular polysaccharide may be used (e.g. 40). Suitable peptides can be selected 
by techniques such as phage display using protective anti-saccharide antibodies. As a further 
1 5 alternative, an anti-idiotypic antibody may be used instead of a saccharide antigen (e.g. ref. 41). 

Prime/boost schedules 

Polypeptide/saccharide combinations of the invention may be given as single doses or as part 
of a prime/boost schedule. In a prime/boost schedule, the combinations may be used as the priming 

20 dose, the boosting dose(s), or both. 

If a combination is used for both priming and boosting, it is preferred to use the same 
combination both times. If a combination is used for only one of priming and boosting, it is 
preferred that the other dose should use the polypeptide or saccharide on which the combination is 
based. Thus the invention provides a prime-boost schedule where either (i) one of the saccharide and 

25 polypeptide antigens is used for priming an immune response and a combination are used for 

boosting the response, or (ii) combined saccharide and polypeptide antigens are used for priming an 
immune response but only one is used for boosting the response. 

Various timings for priming and boosting are suitable for use with the invention. In one 
embodiment, a priming dose is given to a child and a booster is given to a teenager (13-18 years) or 

30 young adult (19-25 years), hi another embodiment, a priming dose is given to a teenager or young 
adult and a booster is given during pregnancy. In another embodiment, a priming dose is given to a 
female who intends to become pregnant and a booster is given during pregnancy. 

Immunogenic pharmaceutical compositions 
35 Polypeptide/saccharide combinations are formulated as immunogenic compositions, and 

more preferably as compositions suitable for use as a vaccine in humans (e.g. children or adults). 
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Vaccines of the invention may either be prophylactic {i.e. to prevent infection) or therapeutic (i.e. to 
treat disease after infection), but will typically be prophylactic. Accordingly, the invention includes a 
method for the therapeutic or prophylactic treatment of GBS infection in an animal susceptible to 
GBS infection comprising administering to said animal a therapeutic or prophylactic amount of the 
5 immunogenic compositions of the invention. 

The composition of the invention is preferably sterile. 

The composition of the invention is preferably pyrogen-free. 

The composition of the invention generally has a pH of between 6.0 and 7.0, more preferably 
to between 6.3 and 6.9 e.g. 6.6±0.2. The composition is preferably buffered at this pH. 
10 Other components suitable for human administration are disclosed in reference 42. 

Vaccines of the invention may be administered in conjunction with other immunoregulatory 
agents. In particular, compositions will usually include an adjuvant. Preferred further adjuvants 
include, but are not limited to, one or more of the following set forth below: 

A. Mineral Containing Compositions 

15 Mineral containing compositions suitable for use as adjuvants in the invention include 

mineral salts, such as aluminium salts and calcium salts. The invention includes mineral salts such as 
hydroxides (e.g. oxyhydroxides), phosphates (e.g. hydroxyphoshpates, orthophosphates), sulphates, 
etc. {e.g. see chapters 8 & 9 of ref. 43}), or mixtures of different mineral compounds, with the 
compounds taking any suitable form (e.g. gel, crystalline, amorphous, etc.), and with adsorption 

20 being preferred. The mineral containing compositions may also be formulated as a particle of metal 
salt. See ref. 44. 

B. Oil-Emulsions 

Oil-emulsion compositions suitable for use as adjuvants in the invention include squalene- 
water emulsions, such as MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated into 
25 submicron particles using a microfluidizer). See ref. 45. 

Complete Freund's adjuvant (CFA) and incomplete Freund's adjuvant (IF A) may also be 
used as adjuvants in the invention. 

C. Saponin Formulations 

Saponin formulations, may also be used as adjuvants in the invention. Saponins are a 
30 heterologous group of sterol glycosides and triterpenoid glycosides that are found in the bark, leaves, 
stems, roots and even flowers of a wide range of plant species. Saponin from the bark of the Quillaia 
saponaria Molina tree have been widely studied as adjuvants. Saponin can also be commercially 
obtained from Smilax ornata (sarsaprilla), Gypsophilla paniculata (brides veil), and Saponaria 
officianalis (soap root). Saponin adjuvant formulations include purified formulations, such as QS21, 
35 as well as lipid formulations, such as ISCOMs. 

Saponin compositions have been purified using High Performance Thin Layer 
Chromatography (HP-LC) and Reversed Phase High Performance Liquid Chromatography (RP~ 
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HPLC). Specific purified fractions using these techniques have been identified, including QS7, 
QS17, QS18, QS21, QH-A, QH-B and QH-C. Preferably, the saponin is QS21. A method of 
production of QS21 is disclosed in U.S. Patent No. 5,057,540. Saponin formulations may also 
comprise a sterol, such as cholesterol (see WO 96/33739). 
5 Combinations of saponins and cholesterols can be used to form unique particles called 

Immunostimulating Complexs (ISCOMs). ISCOMs typically also include a phospholipid such as 
phosphatidylethanolamine or phosphatidylcholine. Any known saponin can be used in ISCOMs. 
Preferably, the ISCOM includes one or more of Quil A, QHA and QHC. ISCOMs are further 
described in EP 0 109 942, WO 96/11711 and WO 96/33739. Optionally, the ISCOMS may be 
10 devoid of additional detergent. Seeref. 46. 

A review of the development of saponin based adjuvants can be found at ref. 47. 

C. Virosomes and Virus Like Particles (VLPs) 

Virosomes and Virus Like Particles (VLPs) can also be used as adjuvants in the invention. 
These structures generally contain one or more proteins from a virus optionally combined or 

15 formulated with a phospholipid. They are generally non-pathogenic, non-replicating and generally 
do not contain any of the native viral genome. The viral proteins may be recombinantly produced or 
isolated from whole viruses. These viral proteins suitable for use in virosomes or VLPs include 
proteins derived from influenza virus (such as HA or NA), Hepatitis B virus (such as core or capsid 
proteins), Hepatitis E virus, measles virus, Sindbis virus, Rotavirus, Foot-and-Mouth Disease virus, 

20 Retrovirus, Norwalk virus, human Papilloma virus, HIV, RNA-phages, QB-phage (such as coat 
proteins), GA-phage, fr-phage, AP205 phage, and Ty (such as retrotransposon Ty protein pi). VLPs 
are discussed further in WO 03/024480, WO 03/024481, and Refs. 48, 49, 50 and 51. Virosomes are 
discussed further in, for example, Ref. 52 

D. Bacterial or Microbial Derivatives 

25 Adjuvants suitable for use in the invention include bacterial or microbial derivatives such as: 

(1) Non-toxic derivatives of enterobacterial lipopoly saccharide (LPS) 

Such derivatives include Monophosphoryl lipid A (MPL) and 3-O-deacylated MPL 
(3dMPL). 3dMPL is a mixture of 3 De-O-acylated monophosphoryl lipid A with 4, 5 or 6 acylated 
chains. A preferred "small particle" form of 3 De-O-acylated monophosphoryl lipid A is disclosed in 
30 EP 0 689 454. Such "small particles" of 3dMPL are small enough to be sterile filtered through a 
0.22 micron membrane (see EP 0 689 454). Other non-toxic LPS derivatives include 
monophosphoryl lipid A mimics, such as aminoalkyl glucosaminide phosphate derivatives e.g. 
RC-529. See Ref. 53. 

(2) Lipid A Derivatives 

35 Lipid A derivatives include derivatives of lipid A from Escherichia coli such as OM-174. 

OM-174 is described for example in Ref. 54 and 55. 

(3) Immunostimulatory oligonucleotides 
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Immunostimulatory oligonucleotides suitable for use as adjuvants in the invention include 
nucleotide sequences containing a CpG motif (a sequence containing an unmethylated cytosine 
followed by guanosine and linked by a phosphate bond). Bacterial double stranded RNA or 
oligonucleotides containing palindromic or poly(dG) sequences have also been shown to be 
5 immunostimulatory. 

The CpG's can include nucleotide modifications/analogs such as phosphorothioate 
modifications and can be double-stranded or single-stranded. Optionally, the guanosine may be 
replaced with an analog such as 2 5 -deoxy-7 -deazaguanosine . See ref. 56, WO 02/26757 and WO 
99/62923 for examples of possible analog substitutions/The adjuvant effect of CpG oligonucleotides 

10 is further discussed in Refs. 57, 58, WO 98/40100, U.S. Patent No. 6,207,646, U.S. Patent No. 
6,239,1 16, and U.S. Patent No. 6,429,199. 

The CpG sequence may be directed to TLR9, such as the motif GTCGTT or TTCGTT. See 
ref. 59. The CpG sequence may be specific for inducing a Thl immune response, such as a CpG-A 
ODN, or it may be more specific for inducing a B cell response, such a CpG-B ODN. CpG-A and 

15 CpG-B ODNs are discussed in refs. 60, 61 and WO 01/95935. Preferably, the CpG is a CpG-A ODN. 
Preferably, the CpG oligonucleotide is constructed so that the 5' end is accessible for receptor 
recognition. Optionally, two CpG oligonucleotide sequences may be attached at their 3' ends to form 
"immunomers". See, for example, refs. 62, 63, 64 and WO 03/035836. 

(4) ADP-ribosylating toxins and detoxified derivatives thereof. 

20 Bacterial ADP-ribosylating toxins and detoxified derivatives thereof may be used as 

adjuvants in the invention. Preferably, the protein is derived from E. coli (i.e., E. coli heat labile 
enterotoxin "LT), cholera ("CT"), or pertussis ("PT"). The use of detoxified ADP-ribosylating 
toxins as mucosal adjuvants is described in WO 95/17211 and as parenteral adjuvants in WO 
98/42375. Preferably, the adjuvant is a detoxified LT mutant such as LT-K63, LT-R72, and 

25 LTR192G. The use of ADP-ribosylating toxins and detoxified derivaties thereof, particularly LT- 
K63 and LT-R72, as adjuvants can be found in Refs. 65, 66, 67, 68, 69, 70, 71 and 72 each of which 
is specifically incorporated by reference herein in their entirety. Numerical reference for amino acid 
substitutions is preferably based on the alignments of the A and B subunits of ADP-ribosylating 
toxins set forth in Domenighini et al., Mol. Microbiol (1995) 15(6):1165 - 1167, specifically 

30 incorporated herein by reference in its entirety. 
E. Human Immunomodulators 

Human immunomodulators suitable for use as adjuvants in the invention include cytokines, 
such as interleukins (e.g. JL-l, IL-2, IL-4, IL-5, IL-6, IL-7, H.-12, etc.), interferons (e.g. interferon- 
7), macrophage colony stimulating factor, and tumor necrosis factor. 

35 F. Bioadhesives and Mucoadhesives 

Bioadhesives and mucoadhesives may also be used as adjuvants in the invention. Suitable 
bioadhesives include esterified hyaluronic acid microspheres (Ref. 73) or mucoadhesives such as 
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cross-linked derivatives of poly(acrylic acid), polyvinyl alcohol, polyvinyl pyrollidone, 
polysaccharides and carboxymethylcellulose. Chitosan and derivatives thereof may also be used as 
adjuvants in the invention. E.g., ref. 74. 

G. Microparticles 

5 Microparticles may also be used as adjuvants in the invention. Microparticles (i.e. a particle 

of ~100nm to ~150/mi in diameter, more preferably ~200nm to ~30/xm in diameter, and most 
preferably ~500nm to ~10/zm in diameter) formed from materials that are biodegradable and 
non-toxic (e.g. a poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a polyanhydride, 
a polycaprolactone, etc.), with poly(lactide-co-glycolide) are preferred, optionally treated to have a 
10 negatively-charged surface (e.g. with SDS) or a positively-charged surface (e.g. with a cationic 
detergent, such as CTAB). 

H. Liposomes 

Examples of liposome formulations suitable for use as adjuvants are described in U.S. Patent 
No. 6,090,406, U.S. Patent No. 5,916,588, and EP 0 626 169. 
15 I. Polvoxvethvlene ether and Polvoxvethvlene Ester Formulations 

Adjuvants suitable for use in the invention include polyoxyethylene ethers and 
polyoxyethylene esters. Ref. 75. Such formulations further include polyoxyethylene sorbitan ester 
surfactants in combination with ^n octoxynol (Ref. 76) as well as polyoxyethylene alkyl ethers or 
ester surfactants in combination with at least one additional non-ionic surfactant such as an octoxynol 
. 20 (Ref. 77). 

Preferred polyoxyethylene ethers are selected from the following group: polyoxyethylene-9-lauryl 
ether (laureth 9), polyoxyethylene-9-steoryl ether, polyoxytheylene-8-steoryl ether, polyoxyethylene- 
4-lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 

J. Polvphosnhazene (PCPP) 

25 PCPP formulations are described, for example, in Ref. 78 and 79. 

K. Muramvl peptides 

' Examples of muramyl peptides suitable for use as adjuvants in the invention include N- 

acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D-isoglutamine 
(nor-MDP), and N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(l ! -2'-dipalmitoyl-^-glycero-3- 
30 hydroxyphosphoryloxy)-ethylamine MTP-PE). 
L. Imidazoquinolone Compounds . 

Examples of imidazoquinolone compounds suitable for use adjuvants in the invention include 
Imiquamod and its homologues, described further in Ref. 80 and 81. 

The invention may also comprise combinations of aspects of one or more of the adjuvants identified 
35 above. For example, the following adjuvant compositions may be used in the invention: 
(1) a saponin and an oil-in-water emulsion (ref. 82); 
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(2) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g., 3dMPL) (see WO 
94/00153); 

(3) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g., 3dMPL) + a cholesterol; 

(4) a saponin (e.g. QS21) + 3dMPL + IL-12 (optionally + a sterol) (Ref. 83); 

5 combinations of 3dMPL with, for example, QS21 and/or oil-in-water emulsions (Ref. 84); 

(5) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-block polymer L121, 
and thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion. 

(6) Ribi™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% 
10 Tween 80, and one or more bacterial cell wall components from the group consisting of 

monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 
preferably MPL + CWS (Detox™); and 

(7) one or more mineral salts (such as an aluminum salt) + a non-toxic derivative of LPS 
(such as 3dPML). 

15 Aluminium salts and MF59 are preferred adjuvants for parenteral immunisation. Mutant 

bacterial toxins are preferred mucosal adjuvants. 

The composition may include an antibiotic. 

GBS polypeptide(s) and saccharide(s) in the compositions of the invention will be present in 
'immunologically effective amounts' i.e. the administration of that amount to an individual, either in 

20 a single dose or as part of a series, is effective for treatment or prevention of disease. This amount 
varies depending upon the health and physical condition of the individual to be treated, age, the 
taxonomic group of individual to be treated (e.g. non-human primate, primate, etc.), the capacity of 
the individual's immune system to synthesise antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 

25 relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the compositions of the invention are prepared as injectables. Direct delivery of 
the compositions will generally be parenteral (e.g. by injection, either subcutaneously, 
intraperitoneally, intravenously or intramuscularly or delivered to the interstitial space of a tissue) or 

30 mucosal (e.g. oral or intranasal [85,86]). The compositions can also be administered into a lesion. 
The invention provides a syringe containing a composition of the invention. 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be treated. The 
vaccines are particularly useful for vaccinating children and teenagers, and more particularly 

35 females. 
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As well as GBS polypeptides and saccahrides, the composition of the invention may 
comprise further antigens. For example, the composition may comprise one or more of the following 
further antigens: 

- antigens from Helicobacter pylori such as CagA [87 to 90], VacA [91, 92], NAP [93, 94, 95], 
5 HopX [e.g. 96], HopY [e.g. 96] and/or urease. 

_ a saccharide antigen from N. meningitidis serogroup A, C, W135 and/or Y, such as the 
oligosaccharide disclosed in ref. 97 from serogroup C [see also ref 98] or the 
oligosaccharides of ref. 99. 

- a saccharide antigen from Streptococcus pneumoniae [e.g. 100, 101, 102]. 
10 - an antigen from hepatitis A virus, such as inactivated virus [e.g. 103, 104]. 

- an antigen from hepatitis B virus, such as the surface and/or core antigens [e.g. 104, 105]. 

- an antigen from Bordetella pertussis, such as pertussis holotoxin (PT) and filamentous 
haemagglutinin (FHA) from B.pertussis, optionally also in combination with.pertactin and/or 
agglutinogens 2 and 3 [e.g. refs. 106 & 107]. 

15 - a diphtheria antigen, such as a diphtheria toxoid [e.g. chapter 3 of ref. 108] e.g. the CRM197 
mutant [e.g. 109]. 

- a tetanus antigen, such as a tetanus toxoid [e.g. chapter 4 of ref. 128]. 

- a saccharide antigen from Haemophilus influenzae B [e.g. 98]. 

- an antigen from hepatitis C virus [e.g. 110]. 

20 - an antigen fromN. gonorrhoeae [e.g. Ill, 112, 113, 114]. 

- an antigen from Chlamydia pneumoniae [e.g. refs. 115 to 121]. 

- an antigen from Chlamydia trachomatis [e.g. 122]. 

- an antigen from Porphyromonas gingivalis [e.g. 123]. 

- polio antigen(s) [e.g. 124, 125] such as OPV or, preferably, IPV. 

25 - rabies antigen(s) [e.g. 126] such as lyophilised inactivated virus [e.g. 127, RabAvert™]. 

- measles, mumps and/or rubella antigens [e.g. chapters 9, 10 & 1 1 of ref. 128]. 

- influenza antigen(s) [e.g. chapter 19 of ref. 128], such as the haemagglutinin and/or 
neuraminidase surface proteins. 

- an antigen from Moraxella catarrhalis [e.g. 129]. 

30 - an antigen from Streptococcus pyogenes (group A streptococcus) [e.g. 3,130, 131]. 

- an antigen from Staphylococcus aureus [e.g. 1 32]. 

- an antigen from Bacillus anthracis [e.g. 133, 134, 135]. 

- an antigen from a virus in the flaviviridae family (genus flavivirus), such as from yellow 
fever virus, Japanese encephalitis virus, four serotypes of Dengue viruses, tick-borne 

35 encephalitis virus, West Nile virus. 



23 



WO 2004/041157 



PCT/US2003/029167 



- a pestivirus antigen, such as from classical porcine fever virus, bovine viral diarrhoea virus, 
and/or border disease virus . 

- a parvovirus antigen e.g. from parvovirus B19. 

- a prion protein (e.g. the CJD prion protein) 

5 - an amyloid protein, such as a beta peptide [136] 

- a cancer antigen, such as those listed in Table 1 of ref. 137 or in tables 3 & 4 of ref. 138. 

The composition may comprise one or more of these further antigens. 

Toxic protein antigens may be detoxified where necessary (e.g. detoxification of pertussis 
toxin by chemical and/or genetic means [107]). 
10 Where a diphtheria antigen is included in the composition it is preferred also to include 

tetanus antigen and pertussis antigens. Similarly, where a tetanus antigen is included it is preferred 
also to include diphtheria and pertussis antigens. Similarly, where a pertussis antigen is included it is 
preferred also to include diphtheria and tetanus antigens. DTP combinations are thus preferred. 
Saccharide antigens are preferably in the form of conjugates. Carrier proteins for the conjugates are 
15 the same as those described above for GBS saccharide conjugation, with CRM197 being preferred. 

Antigens in the composition will typically be present at a concentration of at least l]Lig/ml 
each. In general, the concentration of any given antigen will be sufficient to elicit an immune 
response against that antigen. 

As an alternative to using protein antigens in the composition of the invention, nucleic acid 
20 encoding the antigen may be used. Protein components of the compositions of the invention may 
thus be replaced by nucleic acid (preferably DNA e.g. in the form of a plasmid) that encodes the 
protein. 

Methods of treating patients 

The invention provides polypeptide/saccharide combinations of the invention for use as 
25 medicaments.. The medicament is preferably able to raise an immune response in a mammal (i.e. it is 
an immunogenic composition) and is more preferably a vaccine. 

The invention also provides a method of raising an immune response in a patient, comprising 
administering to a patient a composition of the invention. The immune response is preferably 
protective against streptococcal disease, and may comprise a humoral immune response and/or a 
30 cellular immune response. 

The invention also provides the use of polypeptide/saccharide combination of the invention in 
the manufacture of a medicament for raising an immune response in an patient. The medicament is 
preferably an immunogenic composition (e.g. a vaccine). The medicament is preferably for the 
prevention and/or treatment of a disease caused by GBS (e.g. meningitis, sepsis, chorioamnionitis). 
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The invention also provides for a kit comprising a first component comprising the 
immunogenic compositions of the invention. The kit may further include a second component 
comprising one or more of the following: instructions, syringe or other delivery device, adjuvant, or 
pharmaceutically acceptable formulating solution. 
5 The invention also provides a delivery device pre-filled with the immunogenic compositions 

of the invention. 

The invention also provides a method for raising an immune response in a mammal 
comprising the step of administering an effective amount of a composition of the invention. The 
immune response is preferably protective and preferably involves antibodies and/or cell-mediated 
10 immunity. The method may raise a booster response. 
Process for manufacturing 

The invention provides a process for preparing a composition of the invention, comprising 
the step of mixing (i) one or more GBS polypeptide antigens with (ii) one or more GBS saccharide 
antigens. 

15 The process may comprise the step of covalently linking the GBS polypeptide to the GBS 

saccharide in order to form a conjugate. 
Definitions 

The term "comprising" means "including" as well as "consisting" e.g. a composition 

"comprising" X may consist exclusively of X or may include something additional e.g. X + Y. 
20 The term "about" in relation to a numerical value x means, for example, x±\0%. 

The word "substantially" does not exclude "completely" e.g. a composition which is 

"substantially free" from Y may be completely free from Y. Where necessary, the word 

"substantially" may be omitted from the definition of the invention. 

MODES FOR CARRYING OUT THE INVENTION 
25 GBS serotype III is grown in Todd-Hewitt broth as described in reference 36 and its capsular 

polysaccharide was purified. The polysaccharide is depolymerised, sized and purified as described in 

reference 14 to give oligosaccharide antigen. Similar procedures are used to prepare capsular 

polysaccharides from other GBS serotypes. 

The oligosaccharide is either admixed with or covalently conjugated (directly or via a linker) 
30 to purified serotype V protein. Preferably, the protein comprises a GBS antigen or a fragment 

thereof selected from the group consisting of GBS 80, GBS 91, GBS 104, GBS 147, GBS 173, GBS 

276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, GBS 361, GBS 404, 

GBS 656, GBS 690, and GBS 691. 



25 



WO 2004/041157 



PCT/US2003/029167 



It will be understood that the invention has been described by way of example only and 
modifications may be made whilst remaining within the scope and spirit of the invention. All 
documents cited herein are incorporated by reference in their entirety. 
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CLAIMS: 

1. An immunogenic composition comprising a GBS saccharide antigen and at least 
two GBS polypeptide antigens, wherein said GBS saccharide antigen comprises a 
saccharide selected from GBS serotype la, lb, and III, and wherein said GBS polypeptide 
antigens comprise a combination of at least two polypeptide or fragments thereof selected 
from the antigen group consisting of GBS 80, GBS 91, GBS 104, GBS 147, GBS 173, 
GBS 276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, GBS 
361, GBS 404, GBS 656, GBS 690, and GBS 691. 

2. The immunogenic composition of claim 1, wherein said GBS polypeptide 
antigens further comprise a GBS polypeptide or a fragment thereof of serogroup II. 

3. The immunogenic composition of claim 1, wherein said GBS polypeptide antigen 
combination comprises GBS 80 or a fragment thereof. 

4. The immunogenic composition of claim 3, wherein said GBS polypeptide 
antigens comprise a combination of two GBS antigens or fragments thereof selected from 
the group consisting of (1) GBS 80 and GBS 91, (2) GBS 80 and GBS 104, (3) GBS 80 
and GBS 147, (4) GBS 80 and GBS 173, (5) GBS 80 and GBS 276, (6) GBS 80 and GBS 
305, (7) GBS 80 and GBS 313, (8) GBS 80 and GBS 322, (9) GBS 80 and GBS 328, (10) 
GBS 80 and GBS 330, (1 1) GBS 80 and GBS 338, (12) GBS 80 and GBS 358, (13) GBS 
80 and GBS 361, (14) GBS 80 and GBS 404, (14) GBS 80 and GBS 404, (15) GBS 80 
and GBS 656, (16) GBS 80 and GBS 690, and (17) GBS 80 and GBS 691. 

5. The immunogenic composition of claim 4, wherein said combination is selected 
from the group consisting of (1) GBS 80 and GBS 338; (2) GBS 80 and GBS 361, (3) 
GBS 80 and GBS 305, (4) GBS 80 and GBS 328, (5) GBS 80 and GBS 690, (6) GBS 80 
and GBS 691 and (7) GBS 80 and GBS 147. 
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6. The immunogenic composition of claim 4, wherein said combination comprises 
GBS 80 and GBS 691. 

7. The immunogenic composition of claim 1, wherein said composition comprises a 
combination of at least three GBS polypeptide antigens. 

8. The immunogenic composition of claim 7, wherein said combination comprises 
GBS 80 and GBS 691. 

9. The immunogenic composition of claim 7, wherein said combination comprises 
GBS 80. 

10. The immunogenic composition of claim 1, wherein at least one GBS polypeptide 
antigen is covalently linked to the GBS saccharide antigen. 

11. The immunogenic composition of claim 1, wherein said GBS saccharide antigen 
is covalently linked to a carrier protein. 

12. The immunogenic composition of claim 1 1, wherein said carrier protein is 
selected from the group consisting of tetanus toxoid, diphtheria toxoid, N. meningitides 
outer membrane protein, heat shock protein, pertusis protein, protein D from H. 
influenzae, and toxin A or B from C. difficile. 

13. The immunogenic composition of claim 12, wherein said carrier protein is 
selected from the group consisting of tetanus toxoid and diphtheria toxoid. 

14. The immunogenic composition of claim 13, wherein said carrier protein is a 
diphtheria toxoid. 

15. The immunogenic composition of claim 14, wherein said diphtheria toxoid is 
CRM197. 
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16. A method for the therapeutic or prophylactic treatment of GBS infection in an 
animal susceptible to GBS infection comprising administering to said animal a 
therapeutic or prophylactic amount of the immunogenic composition of claim 1. 

17. A method for the manufacture of a medicament for raising an immune response 
against GBS comprising combining a GBS saccharide antigen and at least two GBS 
polypeptide antigens, wherein said GBS saccharide antigen comprises a saccharide 
selected from GBS serotype la, lb, and III, and wherein said GBS polypeptide antigens 
comprise a combination of at least two polypeptide or fragments thereof selected from 
the antigen group consisting of GBS 80, GBS 91, GBS 104, GBS 147, GBS 173, GBS 
276, GBS 305, GBS 313, GBS 322, GBS 328, GBS 330, GBS 338, GBS 358, GBS 361, 
GBS 404, GBS 656, GBS 690, and GBS 691. 
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SEQUENCE LISTING 

SEQ ID NO. 1 

ATGAAATTATCGAAGAAGTTATTGTTTTCGGCTGCTGTTTTAACAATGGTGGCGGGGTCAACTGTTGAACCAGTAGCTCAGTTTGC 
GACTGGAATGAGTATTGTAAGAGCTGCAGAAGTGTCACAAGAACGCCCAGCGAAAACAACAGTAAATATCTATAAATTACAAGCTG 
ATAGTTATAAATCGGAAATTACTTCTAATGGTGGTATCGAGAATAAAGACGGCGAAGTAATATCTAACTATGCTAAACTTGGTGAC 
AATGTAAAAGGTTTGCAAGGTGTACAGTTTAAACGTTATAAAGTCAAGACGGATATTTCTGTTGATGAATTGAAAAAATTGACAAC 
AGTTGAAGCAGCAGATGCAAAAGTTGGAACGATTCTTGAAGAAGGTGTCAGTCTACCTCAAAAAACTAATGCTCAAGGTTTGGTCG 
T CGATGCT CTGGAT T CAAAAAGTAATGTGAGATAC T TGTATGTAGAAGAT T TAAAGAATT G AC CT T CAAACATTAC CAAAGCTTAT 
GCTGTACCGTTTGTGTTGGAATTACCAGTTGCTAACTCTACAGGTACAGGTTTCCTTTCTGAAATTAATATTTACCCTAAAAACGT 
TGTAACTGATGAACCAAAftACAGATAAAGATGTTAAAAAATTAGGTCAGGACGATGCAGGTTATACGATTGGTGAAGAATTCAAAT 
GGTTCTTGAAATCTACAATCCCTGCCAATTTAGGTGACTATGAAAAATTTGAAATTACTGATAAATTTGCAGATGGCTTGACTTAT 
AAAT CTGT TGGAAAAAT C AAGATTGGTTCGAAAACACTGAATAGAGATGAGCAC TACACTAT TGATGAAC CAACAGTTGATAAC C A 
AAATAC ATTAAAAATTACGT TTAAAC CAGAGAAATTTAAAGAAATTGCTGAGCTACTTAAAGGAATGACCCT TGTTAAAAAT C AAG 
ATGCTCTTGATAAAGCTACTGCAAATACAGATGATGCGGCATTTTTGGAAATTCCAGTTGCATCAACTATTAATGAAAAAGCAGTT 
T TAGGAAAAGCAAT TGAAAATACTTTTGAACTTCAATATGAC CATACTC CTGATAAAGCTGAC AAT C CAAAACCAT CTAAT C CTC C 
AAGAAAACCAGAAGTTCATACTGGTGGGAAACGATTTGTAAAGAAAGACTCAACAGAAACACAAACACTAGGTGGTGCTGAGTTTG 
ATTTGTTGGCTTCTGATGGGACAGCAGTAAAATGGACAGATGCTCTTATTAAAGCGAATACTAATAAAAACTATATTGCTGGAGAA 
GC TGTTAC TGGGC AAC CAAT CAAATT GAAAT CAC ATACAGAC GGTACGTTTGAGATTAAAGGTT TGGCTTATGC AGTTGATGCGAA 
TGCAGAGGGTAC AGCAGTAACTTACAAATTAAAAGAAACAAAAGCACCAGAAGGTTATGTAAT CC C TGAT AAAGAAA.T CGAGT TTA 
C AGTAT CAC AAACAT C T TATAATACAAAACC AACTGACAT CACGGT TGATAGTGCTGATGCAACAC CTGAT ACAATTAAAAAC AAC 
AAACGTCCTTCAATCCCTAATACTGGTGGTATTGGTACGGCTATCTTTGTCGCTATCGGTGCTGCGGTGATGGCTTTTGCTGTTAA 
GGGGATGAAGCGTCGTACAAAAGATAAC 

SEQ ID NO: 2 

MKX*SK2CLLFSAAVLTMVAGSTVEPVAQ 

NVKGLQGVQFKRYKVKTDISVDELKKLTWEAADAKVGTILEEGVSLPQKTNAQGLVVDALDSKSNVRYL 
AVPFVIjELPVANSTGTGFLSEINIYPKNWTDEPKTDKDVKKLGQDDAGYTIGEEFKWFIiKSTIPANL 

KS VGKI KI GS KTLNRDEHYT IDEPTVDNQNTLKI TFKPEKFKE I AELLKGMTLVKNQDALDKATANTDDAAFLEIPVASTINEKAV 
LGKAI ENTFELQYDHTPDKADNPKPSNPPRKPEVHTGGKRFVKKDSTETQTLGGAE FDLLASDGTAVKWTDALI KANTNKNYI AGE 
AVTGQP I KLKSHTDGTFE I KGIAYAVDANAEGTAVT YKLKETKAPEGYVI PDKE I EFTVSQTSYNTKPTDITVDSADATPDTIKNN 
KRPS I PNTGGI GTAI FVAI GAAVMAFAVKGMKRRTKDN 

SEQ ID NO. 3 

ATGAAAAAAGGAC AAGTAAAT GATACTAAGC AAT CTTAC T CT C TACGT AAAT ATAAATT TGGT TTAGCAT CAGTAATTT TAGGGT C 
ATT CATAATGGT C AC AAGT C C TGT TTTTGCGGAT C AAA.CTACAT CGGTT C AAGTT AATAAT C AGACAGGCAC TAGTGTGGATG CTA 
ATAATT CTT C C AATGAGACAAGTG CGT C AAGTGTGATTAC TT C C AAT AAT GAT AGTGTT CAAGCGT CTGAT AAAGT TGTAAATAGT 
C AAAAT ACGGCAAC AAAGGAC AT TACTACT C CTTTAGTAGAGACAAAGC C AATGGTGGAAAAAAC AT TAC CTG AACAAGGGAATT A 
TGTT TATAGC AAAGAAACCGAGGTGAAAAAT ACAC CT T CAAAAT CAGCCC CAGTAGCTT T CTATGCAAAGAAAGGTGAT AAAGT TT 
T CTATGAC CAAGT AT T TAATAAAGATAATGTGAAATGGATT T CATATAAGT CTTTTTGTGGCGTACGT CGATACGC AGCTATTGAG 
T C ACTAGAT C C ATC AGGAGGTT C AGAGAC TAAAGCAC CTACT C CTGTAACAAAT TCAGGAAGCAATAATCAAGAGAAAATAGCAAC 
GCAAGGAAATTATACATTTTCACATAAAGTAGAAGTAAAAAATGAAGCTAAGGTAGCGAGTCCAACTCAATTTACATTGGACAAAG 
GAGAC AGAAT TTTTT ACGAC CAAATACTAAC TATTGAAGGAAAT CAGTGGTTAT CTTATAAAT CATTCAATGGTGTT CGTCGTTT T 
GTTTTGCTAGGTAAAG C AT CTT C AGTAGAAAAAAC TGAAGATAAAGAAAAAGTGT CTC CT CAAC CACAAGC CCGTATTACTAAAAC 
TGGTAGAC TGACTATTT CTAAC GAAAC AACTACAGGTTTTGATATT TTAAT TACGAATAT TAAAGATGAT AACGGTAT CGCTGCTG 
TTAAGGTACCGGTTTGGACTGAACAAGGAGGGCAAGATGATATTAAATGGTATACAGCTGTAACTACTGGGGATGGCAACTACAAA 
GTAGCTGTATCATTTGCTGACCATAAGAATGAGAAGGGTCTTTATAATATTCATTTATACTACCAAGAAGCTAGTGGGACACTTGT 
AGGTGTAACAGGAACTAAAGTGACAGTAGCTGGAACTAATTCTTCTCAAGAACCTATTGAAAATGGTTTAGCAAAGACTGGTGTTT 
ATAATATTATCGGAAGTACTGAAGTAAAAAATGAAGCTAAAATATCAAGTCAGACCCAATTTACTTTAGAAAAAGGTGACAAAATA 
AATTATGATC^GTATTGACAGCAGATGGTTACCAGTGGATTTCTTACAAATCTTATAGTGGTGTTCGTCGCTATATTCCTGTGAA 
AAAGCTAACTAC^GTAGTGAAAAAGCGAAAGATGAGGCGACTAAACCGACTAGTTATCCCAACTTACCTAAAACAGGTACCTATA 
CATTTACTAAAACTGTAGATGTGAAAAGTGAACCTAAAGTATCAAGTCCAGTGG 

TATGATCAAGTGTTAGTAGTAGATGGTCATCAGTGGATTTCATACAAGAGTTATTCCGGTATTCGTCGCTATATTGAAATT 

SEQ ID NO. 4 

MKKGQVITOTKQSYSLRKYKFGIjASVILGSFI 

QNTATKDI TTPLVETKPMVEKTLPEQG3STYVYSKE S YKS FCGVRRYAAIE 

SLDPSGGSETKAPTPVTNSGSNNQEKIATQGNYTFSHKVEVKNEAKVASPTQFTLDKGDRIFYDQILTIEGNQWLSYKSFNGVRRF 
VLLGKASSVEKTEDKEKVS PQPQARI TKTGRLTI SNETTTGFDILITNI KDDNGIAAVKVPVWTEQGGQDDI KWYTAVTTGDGNYK 
VAVS FADHKI^KGL YNIHLYYQEASGTLVGVTGTKVTVAGTNS SQE P I ENGLAKTGVYNI I GS TEVKNEAKI S SQTQFTLEKGDKI 
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NYDQVLTADGYQWISYKSYSGVRRYIPVKKLTTSSEKAKDEATKPTSYPNLPKTGTYTFTKTVDVKSQPKVSSPVEFNFQKGEKIH 
YDQVLWDGHQWI S YKS YSGIRRYI E I 

SEQ ID NO. 5 

ATGAAAAAGAGACAAAAAATATGGAGAGGGTTATCAGTTACTTTACTAATCCTGTCCCAAATTCCATTTGGTATATTGGTACAAGG 
TGAAACC C AAGATAC C AAT CAAGCACTTGGAAAAGTAATTGTTAAAAAAACGGGAGAC AATGCT ACAC CAT TAGGC AAAG CGACTT 
TTGTGTTAAAAAATGACAATGATAAGTC^GAAACAAGTCACGAAACGGTAGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAA 
CCTGGAGACTACACATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACTGATAAAACCTGGAAAGTTAAAGTTGCAGATAA 
CGGAGCAACAATAATCGAGGGTATGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGCCCAATATCCAAAATCAGCTA 
TTTATGAGGATACAAAAGAA^TTACCC^TTAGTTAATGTAGAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCCAATA 
AATGGAAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAA^TTACAGGGGTCAATGATCTCGATAAGAATAAATA 
TAAAATTGAATTAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGTCGTTGTGCTATTAGATA 
ATTCAAATAGTATGAATAATGAAAGAGCCAATAATTCTCAAAGAGC^^ 

ATTACATCAAATAAAGACAATAGAGTAGCTCTTGTGACATATGCCTCAACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGG 
AGTTGCCGATCAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAACTACTTTTACAGCAACTACACATAATT 
ACAGT TATT T AAAT TTAACAAATGATGCTAACGAAGT TAATAT TCTAAAGT CAAGAATTC C AAAGGAAGCGGAGCATATAAATGGG 
GAT CGCACGCT CTAT C AATT TGGTGCGACATTTACT CAAAAAGCT CT AATGAAAGCAAATGAAATT TTAGAGACAC AAAGT T CTAA 
TG CTAGAAAAAAACTTATTTTT CACGT AACTGATGGTGT C C C TACGATGT CT TATGC CATAAAT TTTAAT CCT T ATAT AT CAAC AT 
CTTAC CAAAAC CAGTTTAATT C TTTTTT AAAT AAAATACCAGAT AGAAGTGGT ATT CT C C AAGAGGAT TTTATAAT CAATGGTGAT 
GATTAT CAAATAGT AAAAGGAGATGGAGAGAGTTTTAAAC TGT T TT CGGATAGAAAAGTT C CTGTTACTGGAGGAACGACACAAGC 
AGCT TAT CGAGTAC CGCAAAATC AACT CT C TGTAATGAGTAATGAGGGATATGCAAT TAAT AGTGGATATATT TATCT CT ATTGGA 
GAGATTAC AAC TGGGT CTAT C C ATTTGAT CCTAAGAC AAAGAAAGTT T CTG C AACGAAAC AAAT C AAAACT CATGGTGAGC C AAC A 
ACATTAT AC TTT AATGGAAAT ATAAGAC CTAAAGGTT ATGAC AT TTTTACTGT TGGGATTGGTGTAAACGGAGAT C CTGGTGCAAC 
T C CT C TTGAAGCTGAGAAAT TTATGC AAT C AAT AT C AAGTAAAACAGAAAATTATACTAATGTTGATGAT ACAAATAAAAT TTATG 
ATGAG CTAAAT AAAT ACTTTAAAACAATTGTTGAGGAAAAACATT CTAT TGTTGATGGAAATGTGACTGATC C TATGGGAGAGATG 
ATTGAATT C CAATTAAAAAATGGT CAAAGTT TTACAC AT GATGATTACGT TT TGGT TGGAAATGATGG CAGT CAATTAAAAAATGG 
TGTGGCTCTTGGTGGACCAAACAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACATCTCAAACCATCAAAA 
TCAATCATTTGAACTTAGGAAGTGGAGAAAAAGTAGTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAA^ 
TAGAATACAAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAA.GAACCAAATACTATTCGTGATTTCCCAATTCCCAAAATTCG 
TGATGTTCGTGAGTTTCCGGTACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAGTTAATAAAGACAAAC 
ATTCAGAATCGCTTTTGGGAGCTAAGTTTCAACTTCAGATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCCAGAGGGAAGT 
GATGTTACAAC AAAGAATGATGGT AAAATT TATT T TAAAGC ACTT CAAGATGGTAAC TAT AAAT TATATGAAATTTCAAGT C C AGA 
TGGCTATATAGAGGTT AAAACGAAAC CTGTTGTGACAT T TACAAT T CAAAATGGAGAAGTTACGAAC CTGAAAGC AGATC CAAATG 
CTAATAAAAAT C AAAT CGGGTAT CTTGAAGGAAATGGT AAACAT C TTATT ACCAAC ACTC C CAAACGC CCAC CAGGTGTT TTTC C T 
AAAACAGGGGGAATTGGTACAATTGTCTATATATTAGTTGGTTCTACTTTTATGATACTTACCATTTGTTCTTTCCGTCGTAAACA 

ATTG 

SEQ ID NO. 6 

MKKRQKIWRGLSVTLLILSQIPFGILVQ 

PGDYTLREETAPIGYKKTDKTWKVKVADNGATII^ 

NGKDGRREIAEGWLSKKITGVMDLDKNKYKI^ 

I TSNKDNRVALVTYASTI FDGTEATVS KGVADQNGKALNDSVSWDYH PKEAEHING 
DRTLYQFGATFTQKALMKANEI LETQSSNARKKIiI FHVTDGVPTMS YAINFNP YI STS YQNQFNS FIjNKI PDRSGI LQEDFI INGD 
DYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWYPFDPKTKKVS 
TLYFNGNIRPKGYDI FTVGI GVNGDPGATPLEAEKFMQS I SSKTENYTNVDDTNKI YDEIiNKYFKTI VEEKHS IVDGNVTDPMGEM 
I EFQLKNGQSFTHDDYVLVGNDGSQLKNGVALGGPNSDGGI LKDVTVTYDKTSQTI KINHLNLGSGQKVVLTYDVRLKDNYISNKF 
YNTNNRTTLSPKSEKEPNTIRDFPIPKIRD^ 

DVTTKNDGKIYFKALQDGNYKLYE I SSPDGYI EVKTKP WTFTI QNGEVTNLKADPNANKNQI GYLEGNGKHLI TNTPKRPPGVFP 
KTGGIGTIVYILVGSTFMILTICSFRRKQL 

SEQ ID NO. 7 

GTGGATAAACATCACTCAAAAAAGGCTATTTTAAAGTTAACACTTATAACAACTAGTA^ 

CAAGAATTAAAAAACCAAGAGCAATCACCTGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATACTGTTGAAAAAACATCT 
GTAACAGCTGCTTCTGCTAGTAATACAGCGAAAGAAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAGAGTTATCT 
AAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGAAGAAGAATATCCCTCTAAACCAGAGACAACCAACAATAAAGAAAGCAATGTAGTAACA 
AATGCTTCAACTGCAATAGCACAGAAAGTTCCCTCAGCATATGAAGAGGTGAAGCCAGAAAGCAAGTCATCGCTTGCTGTTCTTGATACATCTAAA 

GATAGCCCAAAAGATGATAAGCACAGCTTTAAAACTAAGACAGAATTTGAGGAATT^ 

GATAAGATTGTTTTTGCACATAACTACGCCAACAATACAGAAACGGTGGCTGATATTGCAGCAGCTATGAAAGATGGTTATGGTTCAGAAGCAAAG 
AATATTTCGC!ATGGTACAGACGTTGCTGGTATTTTTGTAGGTAATAGTAA^ 

GCTCAAGTCTTATTAATGCGTATTCCAGATAAAATTGATTCGGACAAATTTGGTGAAGCATATGCTAAAGCAATCACAGACGCTGTTAATCTAGGA 
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GCAAAAACGATTAATATGAGTATTGGAAAAACAGCTGATTCTTTAATTGCTCTCAATGATAAAGTTAA^ 

GGCGTTGCAGTTGTTGTGGCTGCCGGAAATGAAGGCGCATTTGGTATGGATTATAGCAAACCATTATCAACTAATCCTGACTACGGTACGGTTAAT 
AGTCCAGCTATTTCTGAAGATACTTTGAGTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAACAACTATTGAAGGTAAGTTA 
GTTAAGTTGCCGATTGTGACTTCTAAACCTTTTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAAAAAAGACTTTGAAGGT 
AAGGACTTTAAAGGTAAGATTGCATTAATTGAGCGTGGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACAAATGCAGGTGTTGTTGGT 
ATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTTCTAATTCCTTACCGTGAATTACCTGTGGGGATTATTAGTAAAGTAGATGGCGAGCGT 
ATAAAAAATAGTT CAAGTCAGTTAAC ATTTAAC CAGAGTTTTGAAGTAGTTGATAGCC AAGGTGGTAATCGTATG CTGGAACAATC AAGTTGGGGC 
GTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTTTGAAATTTATTCTTCAACCTATAATAATCAATACCAAACAATGTCTGGT 
ACAAGTATGGCTTCACCACATGTTGCAGGATTAATGACAATGCTTCAAAGTCATTTGGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAA 
TTGCTAGAATTGTCTAAAAACATCCTCATGAGCTCAGCAACAGCATTATATAGTGAAGAGGATAAGGCGTTTTA 

GGTGTAGTTGATGCTGAAAAAGCTATCCAAGCTCAATATTATATTACTGGAAACGATGGCAAAGCTAAAATTAATCTCAAACGAATGGGAGATAAA 
TTTGATATCACAGTTACAATTCATAAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAGCTAATGTAGCAACAGAACAAGTAAATAAAGGTAAA 
TTTGCCCTTAAACCACAAGCCTTGCTAGATACTAATTGGCAGAAAGTAATTCTTCGTGATAAAGAAACACAAGTTCGATTTACTATTGATGCTAGT 
CAATTTAGTCAGAAATTAAAAGAACAGATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACGTTTTAAAGAAGCCAAGGATAGTAATCAGGAGTTA 
ATGAGTATTCCTTTTGTAGGATTTAATGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATTTATAAGACGCTTTCTAAAGGTAGTTTCTAC 
TATAAACCAAATGATACAACTCATAAAGACCAATTGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATACTGCCTTGTTAACACAATCA 
GCGTCTTGGGGCTATGTTGATTATGTCAAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTCCAAAAAGAATTATTTTAGGAACTTTTGAGAAT 
AAGGTTGAGGATAAAACAATTCATCTTTTGGAAAGAGATGCAGCGAATAATCCATATTTTGCCATTTCTCCAAATAAAGATGGAAATAGGGACGAA 
ATCACTCCCCAGGCAACTTTCTTAAGAAATGTTAAGGATATTTCTGCTCAAGTTCTAGATCAAAATGGAAATGTTATTTGGCAAAGTAAGGTTTTA 
CCATCTTATCGTAAAAATTTCCATAATAATCCAAAGCAAAGTGATGGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAGATAAGGATGGC 
AAAGTTGTAGCAGATGGTTTTTATACTTATCGCTTACGTTACACACCAGTAGCAGAAGGAGCAAATAGTCAGGAGTCAGACTTTAAAGTACAAGTA 
AGTACTAAGTC^CCAAATCTTCCTTCACGAGCTC^GTTTGATGAAACTAATCGAACATTAAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCT 
ACATATCGTTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGGATGAGACTTCTTACCATTATTTCCATATAGATCAAGAAGGT 
AAAGTGACACTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTAGACCCTAAGGCCTTGACACTTGTTGTGGAAGATAAAGCTGGTAAT 
TT CGCAACGGTAAAATTGT CTGATCT CTTGAATAAGGCAGTAGTATC AG AGAAAGAAAACGCTATAGTAATTT CTAAC AGTTT C AAATATTTTGAT 
AACTTGAAAAAAGAACCTATGTTTATTTCTAAAAAAGAAAAAGTAGTAAACAAGAATCTAGAAGAAATAATATTAGTTAAGCCGCAAACTACAGTT 
ACTACTCAATCATTGTCTAAAGAAATAACTAAATCAGGAAATGAGAAAGTCCTCACTTCTACAAACAATAATAGTAGCAGAGTAGCTAAGATCATA 
TCACCTAAACATAACGGGGATTCTGTTAACCATACCTTACCTAGTACATCAGATAGAGCAACGAATGGTCTATTTGTTGGTACTTTGGCATTGTTA 
T CTAGTTTACTT CTTTATTTGAAAC CC AAAAAGAGTAAAAATAATAGTAAA 

SEQ ID NO. 8 

VDKHHSKKAILKLTIjITTSILLMHSNQVNAEEQEL 

KNLDTSNLGADLEEEYPSKPETTNNKESNVVTN^ 

DSPKDDKHSFKTKTEFEELK7UCHNIT 

AQVLLMRIPDKIDSDKFGEAYAKAITDAWLGAKTINMSIGKTADSLIALNDKVKLAIiKLASEKGVAVW 

S PAI SEDTLSVAS YESLKT I SE WETTI EGKLVKLP I VTS KPFDKGKAYDVVYANYGAKKDFEGKDFKGKI AhX ERGGGLDFMTKITHATNAGWG 
IVIFNDQEKRGNFLIPYRELPVGIISKVDGER^ 

TS^^SPHVAGLMTMLQSHIlAEKYKGMNLDS KKIjIjELSKNI LMS SATALYSEEDKAFYS PRQQGAGWDAEKAI QAQYYITGNDGKAKINLKRMGDK 

FDITVTIHKLVEGVKELYYQANVATEQVNKGKFALK^ 

MSIPFVGFNGDFANLQALETPIYKTnSKGSFYYKPNDTTHIODQL^ 

KVEDKTIHLIjERDAAlSrNPYFAISPNKDGNRDEITPQATFLRlTVKDISAQVLDQNGNVIW 

KVVADGFYTYRLRYTPVAEGANSQESDFK^QVSTKSPNLPSRAQFDETNRTLSLAMPKESSYVPTYRLQLVL 

KVTLPKTVKIGESEVAVDPKALTLVVEDKAGNFAT^^ 

TTQSLSKEITKSGNEKTVJjTSTNNNSSRVA 

SEQ ID NO. 9 

ATGAAACGTAAATACTTTATTCTTAATACGGTGACGGTTTTAACGTTAGCTGCTGCAATGAATACTAGCAGTATCTATGCTAATAGTACTGAGACA 
AGTGCTTCAGTAGTTCCTACTACAAATACTATCGTTCAAACTAATGACAGTAATCCTACCGCAAAATTTGTATCAGAATCAGGACAATCTGTAATA 
GGTCAAGTAAAACCAGATAATTCTGCGGCGCTTACAACAGTTGACACGCCTCATCATATTTCAGCTCCAGATGCTTTAAAAACAACTCAATCAAGT 
CCTGTCGTTGAGAGTACTTCTACTAAGTTAACTGAAGAGACTTACAAACAAAAAGATGGTCAAGATTTAGCCAACATGGTGAGAAGTGGTCAAGTT 
ACTAGTGAGGAACTCGTTAATATGGCATACGATATTATTGCTAAAGAAAACCCATCTTTAAATGCAGTCATTACTACTAGACGCCAAGAAGCTATT 
GAAGAGGCTAGAAAACTTAAAGATACCAATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGCACAGTATTAAAGGTGGTGAAACC 
AATAATGGCTTGATCTATGCAGATGGAAAAATTAGCACATTTGACAGTAGCTATGTCAAAAAATATAAAGATTTAGGATTTATTATTTTAGGACAA 
ACGAACTTTCCAGAGTATGGGTGGCGTAATATAACAGATTCTAAATTATACGGTCTAACGCATAATCCTTGGGATCTTGCTCATAATGCTGGTGGC 
TCTTCTGGTGGAAGTGCAGCAGCCATTGCTAGCGGAATGACGCCAATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTCCATCTTCTTGG 
ACGGGCTTGGTAGGTTTAAAACCAACAAGAGGATTGGTGAGTAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTAACTAAGTCA 
TCTAGAGACGC AGAAACATTATTAAC TT ATCTAAAGAAAAGCG ATCAAACG CTAGTATCAGTTAATGATTTAAAATCTTTAC CAATTGCTTATACT 
TTGAAATCAC CAATGGG AACAGAAGT T AGTCAAGATGCTAAAAACGCTATTATGGACAACGTC ACATTCTTAAGAAAACAAGGATT CAAAGTAACA 
GAGATAGACTTACCAATTGATGGTAGAGCATTAATGCGTGATTATTCAACCTTGGCTATTGGCATGGGAGGAGCTTTTTCAACAATTGAAAAAGAC 
TTAAAAAAACATGGTTTTACTAAAGAAGACGTTGATCCTATTACTTGGGCAGTTCATGTTATTTATCAAAATTCAGATAAGGCTGAACTTAAGAAA 
T CTATTATGGAAGCC C AAAAACATATGG ATG ATTATCGTAAGGCAATGGAGAAGCTTCAC AAG CAATTTCCTATTTTCTTATCGCCAACGAC CGCA 
AGTTTAGCCCCTCTAAATACAGATCCATATGTAACAGAGGAAGATAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAAGAAGAAAGAATTGCT 
CTCTTTAATCGCCAGTGGGAGCCTATGTTGCGTAGAACACCTTTTACACAAATTGCTAATATGACAGGACTCCCAGCTATCAGTATCCCGACTTAC 
TTATCTGAGTCTGGTTTACCCATAGGGACGATGTTAATGGCAGGTGCAAACTATGATATGGTATTAATTAAATTTGCAACTTTCTTTGAAAAACAT 
CATGGTTTTAATGTTAAATGGCAAAGAATAATAGATAAAGAAGTGAAACCATCTACTGGCCTAATACAGCCTACTAACTCCCTCTTTAAAGCTCAT 
TCATCATTAGTAAATTTAGAAGAAAATTCACAAGTTACTCAAGTATCTATCTCTAAAAAATGGATGAAATCGTCTGTTAAAAATAAACCATCCGTA 
ATGGCATATCAAAAAGCACTTCCTAAAACAGGTGATACAGAATCAAGCCTATCTCCAGTTTTAGTAGTAACCCTTTTATTAGCTTGTTTTAGCTTT 
GTAACAAAAAAGAATCAGAAAAGT 
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SEQ ID NO. 10 

MKRKYFILNTVTVLTLAAAMNTSSIYANS 

PVVESTSTKLTEETYKQKDGQDIiANMVRSGQW 

NNGLIYADGKISTFDSSYVKXYKDLGFIILGQTNF 

TGLVGLKPTRGLVSNEKPDSYSTAWFPLTKSSRDAETLLT 

EIDLPIDGRALMRDYSTIaAIGMGGAFSTIEKDLKKHGFTKE 

SLAPLNTDPYVTEEDKRAIYNMENLSQEERIALFNRQW 

HGFNVKWQRIIDKEVKPSTGLIQPTNSLFKAHSSLWL^ 

VTKKNQKS 

SEQ ID NO. 11 

TTGCGTAAAAAACAAAAACTACCATTTGATAAACTTGCCATTGCGCTT^^ 

ACTGTGACAGAAGACACTCCTGCTACCGAACAAGCCGTAGAACCCCCACAACCAATAGCAGTTTCTGAGGAATCACGATCATCAAAGGAAACTA 
ACCTCACAAACTCCTAGTGATGTAGGAGAAAC^^ 

ACCTCAAAAGCGACTATTAGGGATTTGAACGACCCTTCTCATGTCAAAACCCTGCAGGAAAAAGCAGGCAAGGGAGCTGGGACCGTTGTTGCAGTG 
ATTGATGCTGGTTTTGATAAAAATCATGAAGCGTGGCGCTTAACAGACAAAAC^ 

AAAGAGCACGGTATTACCTATGGCGAGTGGGTCAATGATAAGGTTGCTTATTACCACGACTATAGTAAAGATGGTAAAAACGCTGTTGATCAAGAA 
CACGGCACACACGTGTCAGGGATCTTGTCAGGAAATGCTCCATCTGAAATGAAAGAACCTTACCGCCTAGAAGGTGCGATGCCTGAGGCTCAATTG 
CTTTTGATGCGTGTCGAAATTGTAAATGGACTAGCAGACTATGCTCGTAACTACGCTCAAGCTATCAGAGATGCTGTCAACTTGGGAGCTAAGGTG 
ATTAATATGAGCTTTGGTAATGCTGC^CTAGCTTACGCC^CCTTCCAGACGAAACCAAAAAAGCCTTTGACTATGCCAAATCAAAAGGTGTTAGC 
ATTGTGACCTCAGCTGGTAATGATAGTAGCTTTGGGGGCAAGCCCCGTCTACCTCTAGCAGATCATCCTGATTATG.GGGTGGTTGGGACACCTGCA 
GCGGC^GATTCAACATTGACAGTTGCTTCTTACAGCCCAGATAAAC^GCTCACTGAAACTGCTACGGTCAAAACAGACGATCATCAAGATAAAGAA 
ATGCCTGTTATTTCAACAAACCGTTTTGAGCCAAACAAGGCTTACGACTATGCTTATGCTAATCGTGGTACGAAAGAGGATGATTTTAAGGATGTC 
GAAGGTAAGATTGCCCTTATTGAACGTGGCGATATTGATTTCAAAGATAAGATTGCAAACGCTAAAAAAGCTGGTGCTGTAGGGGTCTTGATCTAT 
GACAATCAAGACAAGGGCTTCCCGATTGAATTGCCAAATGTTGACCAGATGCCTGCGGCCTTTATCAGTCGAAGAGACGGTCTCTTATTAAAAGAC 
AATCCCCCAAAAACCATTACCTTCAATGCG 

GCTGACGGCAATATTAAACCGGATATTGCAGCACCCGGCCAAGATATTTTGTCATCAGTGGCTAACAACAAGTATGCCAAACTTTCTGGAACT 
ATGTCTG CACC ATTGGT AGCGGGTAT C ATGGGACTGTTGCAAAAG CAATATGAGACACAGTATCCTGAT ATGACAC C ATCAGAGCGTCTTGATTTA 
GCTAAGAAAGTATTGATGAGCTCAGCAACTGCCCTATATGATGAAGATGAAAAAGCTTATTTTTCTCCTCGCCAACAGGGAGCAGGAGCAGTCGAT 
GCTAAAAAAGCTTCAGCAGCAACGATGTAT 

ACAGTAACAGTTCACAACAAAT CTGATAAACCTC AAG AGTTGTATTACCAAGTAACTGTT CAAAC AGATAAAGTAGATGGAAAACACTTTGC CTTG 
GCTCCTAAAGCATTGTATGAGACATCATGGCAAAAAATCACAATTCCAGCCAATAGCAGCAAACAAGTCACCGTTCCAATCGATGCTAGTCGATTT 
AGCAAGGACTTGCTTGCCCAAATGAAAAATGGCTATTTCTTAGAAGGTTTTGTTCGTTTCAAACAAGATCCTACAAAAGAAGAGCTTATGAGCATT 
CC ATATATTGGTTTCCGAGGTGATTTTGGC AAT CTGT C AGC CTT AGAAAAACCAATCTATGATAGCAAAGACGGTAGCAGCT ACTATCATGAAGCA 
AATAGTGATGCCAAAGACCAATTAGATGGTGATGGATTACAGTTTTACGCTCTGAAAA 

ACGATTATTAAAGCTGTCAAAGAAGGGGTTGAAAACATAGAGGATATCGAATCTTCAGAGATCACAGAAACCATTTTTGCAGGTACTTTTGCAAAA 

CAAGACGATGATAGCCACTACTATATCCACCGTCACGCTAATGGCAAACCATATGCTGCGATCTCTCCAAATGGGGACGGTAACAGAGATTATGTC 

CAATTCCAAGGTACTTTCTTGCGTAATGCTAAAAACCTTGTGGCTGAAGTCTTGGACAAAGAAGGAAATGTTGTTTGGACAAGTGAGGTAACCGAG 

CAAGTTGTTAAAAACTACAACAATGACTTGGCAAGCACACTTGGTTCAACCCGTTTTGAAAAAACGCGTTGGGACGGTAAAGATAAAGACGGCAAA 

GTTGTTGCTAACGGAACCTACACCTATCGTGTTCGCTACACGCCGATTAGCTCAGGTGCAAAAGAACAACACACTGATTTTGATGTGATTGTAGAC 

AATACG ACACC TGAAGT CGC AACAT CGGCAACATTC TG AACAGAAGATAGTCGTTTG ACACTTGC ATCTAAACCAAAAAC CAGCC AACCGGTTTAC 

CGTGAGCGTATTGCTTACACTTATATGGATGAGGATCTGCCAACAACAGAGTATATTTCTCCAAATGAAGATGGTACCTTTACTCTTCCTGAAGAG 

GCTGAAACAATGGAAGGCGCTACTGTTCCATTGAAAATGTCAGAGTTTACTTATGTTGTTGAAGATATGGCTGGTAACATCACTTATACACCAGTG 

ACTAAGCTATTGGAGGGCCACTCTAATAAGCCAGAACAAGACGGTTCAGATCAAGGACCA 

TCAGGTCAAACACCAGATAAAAAAAAAGAAACTAAACCAGAAAAAGATAGTTCAGGTCAAACACCAGGTAAAA 

CGTACTCTAGAGAAACGATGTTCTAAGCGTGCTTTAGCTAGAAAAGCATCAACAAGAGATCAGTTACCAACGACTAATGACAAGGATACAAATCGT 
TTACATCTCCTTAAGTTAGTTATGACCACTTTCTTCTTGGGA 

SEQ ID NO. 12 

MRKKQKLPFDKLAIALISTSILLNAQ 

APAKTADTPATSKATIRDIiNDPSHVKTLQEKAGKGAGTWAVIDAGFDKNHEAWRLTDKTK^ 

DKVAYYHDYSKEGKIJAVDQEHGTHVSGILSGNAPSE 

MSFGNAAI»AYANLPDETKKAFDYAKSKGVSIVTSAG^ 

TDDHQDKEMPVI S TNRFEPNKAYD YAYANRGTKEDDFKDVEGKI AL I ERGD I DFKDKI ANAKKAGAVGVL I YDNQDKGFP I ELPNV 

DQMPAAFISRRDGLLLKDNPPKTITFNATPK^^ 

LVAGIMGLLQKQYETQYPDMTPSERLDLAKKV^^ 

SDKFEVOTTVHNKSDKPQELYYQVTVQTDKVm 

FTOFKQDPTKEELMSIPYIGFRGDFGNLSALEKPIYT)SKDGSSYra 

KEGVENIEDIESSEITETIFAGTFAKQDDDSHYYIHRHANGK^ 

EVTEQVVKNYl^IASTLGSTRFEKTRW 

LTI^KPKTSQPVYRERIAYTYMDEDLPTTEYISPNEDGTF 

NKPEQDGSDQAPDKKPEAKPEQDGSGQTPDKKXETKPEKDSSGQTPGKTPQKGQSSRTLEKRSSKRALATKA^ 
NRLHLLKLVMTTFFLG 
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SEQ ID NO. 13 

ATGGGACGAGTAATGAAAACAATAACAACATTTGAAAATAAAAAAGTTTTAGTCCTTGGTTTAGCACGATCTGGAGAAGCTGCTGC 
ACGTTTGTTAGCTAAGTTAGGAGCAATAGTGACAGTTAATGATGGCAAACCATTTGATGAAAATCCAACAGCACAGTCTTTGTTGG 
AAGAGGGTATTAAAGTGGTTTGTGGTAGTCATCCTTTAGAATTGTTAGATGAGGATTTTTGTTACATGATTAAAAATCCAGGAATA 
CCTTATAACAATCCTATGGTCAAAAAAGCATTAGAAAAACAAATCCCTGTTTTGACTGAAGTGGAATTAGCATACTTAGTTTCAGA 
ATCTCAGCTAATAGGTATTACAGGCTCTAACGGGAAAACGACAACGACAACGATGATTGCAGAAGTCTTAAATGCTGGAGGTCAGA 
GAGGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTGAAGTTGTTCAGGCTGCGAATGATAAAGATACTCTAGTTATGGAATTA 
TCAAGTTTTCAGCTAATGGGAGTTAAGGAATTTCGTCCTCATATTGCAGTAATTACTAATTTAATGCCAACTCATTTAGATTATCA 
TGGGTCTTTTGAAGATTATGTTGCTGCAAAATGGAATATCCAAAATCAAATGTCTTCATCTGATTTTTTGGTACTTAATTTTAATC 
AAGGTATTTCTAAAGAGTTAGCTAAAACTACTAAAGCAACAATCGTTCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTTACGTA 
CAAGACAAGCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGGTGTCCCAGGAAGCCATAACGTAGAGAATGC 
TCTAGCAACTATTGCGGTTGCTAAACTGGCTGGTATCAGTAATCAAGTTATTAGAGAAACTTTAAGCAATTTTGGAGGTGTTAAAC 
ACCGGTTGCAATCACTCGGTAAGGTTCATGGTATTAGTTTCTATAACGACAGCAAGTCAACTAATATATTGGCAACTCAAAAAGCA 
TTATCTGGCTTTGATAATACTAAAGTTATCCTAATTGCAGGAGGTCTTGATCGCGGTAATGAGTTTGATGAATTGATACCAGATAT 
CACTGGACTTAAACATATGGTTGTTTTAGGGGAATCGGCATCTCGAGTAAAACGTGCTGCACAAAAAGCAGGAGTAACTTATAGCG 
ATGCTTTAGATGTTAGAGATGCGGTACATAAAGCTTATGAGGTGGCACAACAGGGCGATGTTATCTTGCTAAGTCCTGCAAATGCA 
TCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAATTCATTGATACTTTCGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 14 

MGRVMKTITTFENKKVLVLGLARSGEAAARLLAKIjGAIVTVNDGKPFDEN 
PYNNPMVKKALEKQIPVLTEVELAYLVSESQLIGI^ 
SSFQLMGVKEFRPHIAVITNLMPTHLDYHGSFEDYVAAK^^ 
QDKQLFYKGENIMSVDDIGVPGSHNVENALATIAV^ 

LS GFDNTKVI L I AGGLDRGNE FDEL I PD I TGLKHMWLGE S ASRVKRAAQKAGVT YSDALDVRDAVHKAYE VAQQGDVI LLS PANA 
S WDMYKNFE VRGDE F I DTFE S LRGE 

SEQ ID NO. 15 

ATGAAACGTATTGCTGTTTTAACTAGTGGTGGTGACGCCCCTGGTATGAACGCTGCTATCCGTGCAGTTGTTCGTAAAGCAATTTCTGAAGGTATG 
GAAGTTTACGGCATCAACCAAGGTTACTATGGTATGGTGACAGGGGATATTTTCCCTTTGGATGCTAATTCTGTTGGGGATACTATCAACCGTGGA 
GGAACGTTTTTACGTTCAGCACGTTATCCTGAATTTGCTGAACTTGAAGGTCAGCTTAAAGGGATTGAACAGCTTAAAAAACACGGTATTGAAGGT 
GTAGTAGTTATCGGTGGTGATGGTTCTTATCATGGTGCTATGCGTCTAACTGAGCACGGTTTCCCAGCTGTTGGTTTGCCGGGTACAATTGATAAC 
GATATCGTTGGCACTGACTATACTATTGGTTTTGACACAGCAGTTGCGACAGCAGTTGAGAATCTTGACCGTCTTCGTGATACATCAGCAAGTCAT 
AACCGTACTTTTGTTGTTGAGGTTATGGGAAGAAATGCAGGAGATATCGCTCTTTGGTCAGGTATCGCTGCAGGTGCAGATCAAATTATTGTTCCT 
GAAGAAGAGTTCAATATTGATGAAGTTGTCTCAAATGTTAGAGCTGGCTATGCAGCTGGTAAACATCACCAAATCATCGTCCTTGCAGAAGGTGTT 
ATGAGTGGTGATGAGTTTGCAAAAACAATGAAAGCAGCAGGAGACGATAGCGATCTTCGTGTGACGAATTTAGGACATCTGCTCCGTGGTGGTAGT 
CCGACGGCTCGTGATCGTGTCTTAGCATCTCGTATGGGAGCGTACGCTGTTCAATTGTTGAAAGAAGGTCGTGGTGGTTTAGCCGTTGGTGTCCAC 
AACGAAGAAATGGTTGAAAGTCCAATTTTAGGTTTAGCAGAAGAAGGTGCTTTGTTCAGCTTGACTGATGAAGGAAAAATCGTTGTTAATAATCCG 
CATAAAGCGGACCTTCGCTTGGCAGCACTTAATCGTGACCTTGCCAACCAAAGTAGTAAA. 

SEQ ID NO. 16 

MKRIAVLTSGGDAPGMNAAIRAVVRKAISEGMEVYGI 

VWIGGDGS YHGAMRLTEHGFPAVGLPGT IDND I VGTDYT IGFDTAVATAVENLDRLRDTS ASHNRTF WEVMGRNAGDI ALWSGI AAGADQI I VP 
EEEFNIDE VVSNVRAG YAAGKHHQ 1 I VXAEG VMSGDE FAKTMKAAGDDSDLRVTNLGHLLRGG S PTARDRVIiASRMGA YAVQIiLKEGRGGLAVG VH 
NEEMVES P ILGLAEEGALFSLTDEGKI VVmPHKADIiRIiAALNRDIiANQSSK 

SEQ ID NO. 17 

ATGAATAAAAAGGTACTATTGAC^TCGACAATGGC^GCTTCGCTATTATCAGTCGCAAGTGTTC^GCACAAGAAACAGATACGACGTGGACAGCA 
CGTACTGTTTCAGAGGTAAAGGCTGATTTGGTAAAGCAAGACAATA7^ATCATCATATACTGTGAAATATGGTGATACACTAAGCGTTATTTCAGAA 
GCAATGTCAATTGATATGAATGTCTTAGCAAAAA^ 

CAGAAGAGTCATACTGCCACTTCAATGAAAATAGAAACACCAGCAACAAATGCTGCT 

GTTTCTGTTGCAGACCAAAAAGTTTCTCTCAATACAATTTCGGAAGGTATGACACCAGAAGCAGCAACAACGATT 

TCTTCTGCGCCAGCTTTGAAATCAAAAGAAGTATTAGCACAAGAGCAAGCTGTTAGTCAAGCAGCAGCTAATGAACAGGTATCACCAGCTCCTGTG 
AAGTCGATTACTTCAGAAGTTCCAGCAGCTAAAGAGGAAGTT^^ 

GCCGCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAACTGTAGCAGCCCCTAGAGTGGCAAGTGTTAAAGTAGTCACTCCTAAAGTA 
GAAACTGGTGC^TCACCAGAGCATGTATCAGCTCCAGCAGTTCCTGTGACTACGACTTC^CC^GCTACAGACAGTAAGTTACAAGCGACTGAAGTT 
AAGAGCGTTCCGGTAGCACAAAAAGCTCCAACAGCAACACCGGTAGCACAAGCAGCTT^^ 
GGGCTCCAACCTC&TGTTGCAGCTTATAAAGAAA^GTA 

CATGGTAAAGGTTTAGCAGTTGACTTTATTGTAGGTACTAATCAAGCACTTGGTAATAAAGTTGCACAGTACTCTACACAAAATATGGCAGCAAAT 

AACATTTCATATGTTATCTGGCAACAAAAGTTTTACTCAAA^^ 

XSGCGTTACTGCCAACCACTATGACCACGTTCACGTATCATTT^ 

AGACTTTCAAGGTTGTTATATAATTTTTATTA 
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SEQ ID NO. 18 

I^KKVXiLTSTMAASLLSVASVQAQETO 

QKS HT ATSMKI ET PATNAAGQTTATVDIjKTNQ VS VAD QKVS LNT I S EGMT P E AATT I VS PMKT YS S AP ALKS KE VXiAQEQAVS QAAANEQVS PAP V 
KS I TSEVPAAKEEVKPTQTS VSQSTTVS PAS VAAETPAPVAKTVAPVRTVAAPRVAS VKVVTPKVETGAS PEHVSAPAVPVTTTS PATDSKIJQATEV 
KS VP VAQKAPTAT P VAQP AS TTNAVAAHPENAGLQPHVAAYKE KVAS T YGVNE F S T YRAGD PGDHGKGIiAVDF I VGTNQALGNKVAQ YS TQNMAAN 
NI S YVI WQQKFYSNTNS I YGPANTWNAMPDRGGVTANHYDHVHVS FNK 

SEQ ID NO. 19 

ATGAAAAAGAAAATTATTTTGAAAAGTAGTGTTCTTGGTTTAGTCGCTGGGACTTCTATTATGTTCTCAAGCGTGTTCGCGGACCAAGTCGGTGTC 
CAAGTTATAGG CGTCAATGACTTTC ATGGTGCACTTGACAATACTGGAAC AGCAAATATGCCTG ATGGAAAAGTTG CTAATGCTGGTACTG CTGCT 
CAATTAGATGCTTATATGGATGACGCTCAAAAAGATTTCAAACAAACTAACCCTAATGGTGAAAGCATTAGGGTTCAAGCAGGCGATATGGTTGGA 
GCAAGTCCAGCCAACTCTGGGCTTCTTCAAGATGAACCAACTGTCAAAAATTTTAATGCAATGAATGTTGAGTATGGCACATTGGGTAACCATGAA 
TTTGATGAAGGGTTGGCAGAATATAATCGTATCGTTACTGGTAAAGCCCCTGCTCCAGATTCTAATATTAATAATATTACGAAATCATACCCACAT 
GAAGCTGCAAAACAAGAAATTGTAGTGGCAAATGTTATTGATAAAGTTAACAAACAAATTCCTTACAATTGGAAGCCTTACGCTATTAAAAATATT 
CCTGTAAATAACAAAAGTGTGAACGTTGGCTTTATCGGGATTGTCACCAAAGACATCCCAAACCTTGTCTTACGTAAAAATTATGAACAATATGAA 
TTTTTAGATGAAGCTGAAACAATCGTTAAATACGCCAAAGAATTACAAGCTAAAAATGTCAAAGCTATTGTAGTTCTCGCACATGTACCTGCAACA 
AGTAAAAATGATATTGCTGAAGGTGAAGCAGCAGAAATGATGAAAAAAGTCAATCAACTCTTCCCTGAAAATAGCGTAGATATTGTCTTTGCTGGA 
CACAATCATCAATATACAAATGGTCTTGTTGGTAAAACTCGTATTGTACAAGCGCTCTCTCAAGGAAAAGCCTATGCTGATGTACGTGGTGTCTTA 
GATACTGATACACAAGATTTCATTGAGACCCCTTCAGCTAAAGTAATTGCAGTTGCTCCTGGTAAAAAAACAGGTAGTGCCGATATTCAAGCCATT 
GTTGACCAAGCTAATACTATCGTTAAACAAGTAACAGAAGCTAAAATTGGTACTGCCGAGGTAAGTGTCATGATTACGCGTTCTGTTGATCAAGAT 
AATGTTAGTCCGGTAGGCAGCCTCATCACAGAGGCTCAACTAGCAATTGCTCGAAAAAGCTGGCCAGATATCGATTTTGCCATGACAAATAATGGT 
GGCATTCGTGCTGACTTACTCATCAAACCAGATGGAACAATCACCTGGGGAGCTGCACAAGCAGTTCAACCTTTTGGTAATATCTTACAAGTCGTC 
GAAATTACTGGTAGAGATCTTTATAAAGCACTCAACGAACAATACGACCAAAAACAAAATTTCTTCCTTCAAATAGCTGGTCTGCGATACACTTAC 
ACAGATAATAAAGAGGGCGGGGAAGAAACACCATTTAAAGTTGTAAAAGCTTATAAATCAAATGGTGAGGAAATCAATCCTGATGCAAAATACAAA 
TTAGTTATCAATGACTTTTTATTCGGTGGTGGTGATGGCTTTGCAAGCTTCAGAAATGCCAAACTTCTAGGAGCCATTAACCCCGATACAGAGGTA 
TTTATGGCCTATATCACTGATTTAGAAAAAGCTGGTAAAAAAGTGAGCGTTCCAAATAATAAACCTAAAATCTATGTCACTATGAAGATGGTTAAT 
GAAACTATTACACAAAATGATGGTAGACATAGCATTATTAAGAAACTTTATTTAGATCGACAAGGAAAT 

ACTTTAAAC CAAACAAAATCAAAATCTACAAAAATCAAC CCTGTAACTACAATT CACAAAAAACAATTACACCAATTTACAGCTATTAACCCTATG 
AGAAATTATGGCAAACCATCAAACTCCACTACTGTAAAATCAAAACAATTACCAAAAACAAACTCTGAATATGGACAATCATTCCTTATGTCTGTC 
TTTGGTGTTGGACTTATAGGAATTGCTTTAAATACAAAGAAAAAACATATGAAA 

SEQ ID NO. 20 

MKKKI I LKS S VLGLVAGTS IMFS S VFADQVGVQVIGVNDFHGALDNTGTANMPDGKVANAGTAAQLDAYMDDAQKDFKQTNPNGES IRVQAGDMVG 

AS PANSGLLQDE PTVKNFNAMNVEYGTLGNHEFDEGLAE YNRI VTGKAPAPDSNINNITKSYPHEAAKQEIWANVIDKVNKQIPYNWKPYAIKNI 

PVJ^KS VNVGFIGI VTKDI PNLVLRKNYEQY 

HNHQYTNGLVGKTRIVQALSQGKAYADVRGVL^ 

NVSPVGSLITE^QIAIARKSWPDIDFAMTNNGGIRADLLI^ 

TDNKEGGEETPFKWKAYKSNGEEINPDAKYKLVIITOFL^ 

ETITQNDGTHSIIKICLYLDRQGNIVAQEIVSDT 

FGVGLIGIAIiNTKKKHMK 

SEQ ID NO. 21 

ATGAATAAACGCGTAAAAATCGTTGCAACACTTGGTCCTGCGGTTGAATTCCGTGGTGGTAAGAAGTTTGGTGAGTCTGGATACTGGGGTGAAAGC 
CTTGACGTAGAAGCTTCAGCAGAAAAAATTGCTCAATTGATTAAAGAAGGTGCTAACGTTTTCCGTTTCAACTTCTCACATGGAGATCATGCTGAG 
CAAGGAGCTCGTATGGCTACTGTTCGTAAAGCAGAAGAGATTGCAGGACAAAAAGTTGGCTTCCTCCTTGATACTAAAGGACCTGAAATTCGTACA 
GAACTTTTTGAAGATGGTGCAGATTTCCATTCATATACAACAGGTACAAAATTACGTGTTGCTACTAAGCAAGGTATCAAATCAACTCCAGAAGTG 
ATTGCATTGAATGTTGCTGGTGGACTTGACATCTTTGATGACGTTGAAGTTGGTAAGCAAATCCTTGTTGATGATGGTAAACTAGGTCTTACTGTG 
TTTGCAAAAGATAAAGACACTCGTGAATTTGAAGTAGTTGTTGAGAATGATGGCCTTATTGGTAAACAAAAAGGTGTAAACATCCCTTATACTAAA 
ATTCCTTTCCCAGCACTTGCAGAACGCGATAATGCTGATATCCGTTTTGGACTTGAGCAAGGACTTAACTTTATTGCTATCTCATTTGTACGTACT 
GCTAAAGATGTTAATGAAGTTCGTGCTATT^^ 

AAT ATTGATGAGATTAT CGAAGCAGCAGATGGTATTATGATTGCT CGTGGTGATATGGGTATCGAAGTT C C ATTTGAAATGGTT CCAGTTTACC AA 
AAAATGATCATTACTAAAGTTAATGCAGCTGGTAAAGCAGTTATTACAGCAACAAATATGCTTGAAACAATGACTGATAAACCACGTGCGACTCGT 
TCAGAAGTATCTGATGTCTTCAATGCTGTTATTGATGGTACTGATGCTACAATGCTTTCAGGTGAGTCAGCTAATGGTAAATACCCAGTTGAGTCA 
GTTCGTACAATGGCTACTATTGATAAAAATGCTCAAACATTACTCAATGAGTATGGTCGCTTAGACTCATCTGCATTCCCACGTAATAACAAAACT 
GATGTTATTGCATCTGCGGTTAAAGATGCAACAC^CTCAATGGATATCAAACTTGTTGTAACAATTACTGAAACAGGTAATAdAGCTCGTGCa\TT 
TCTAAATTCCGTCCAGATGCAGACATTTTGGCTGTTACATTTGATGAAAAAGTACAACGTTCATTGATGATTAACTGGGGTGTTATCCCTGTCCTT 
GCAG ACAAACCAG CAT CTACAGATGATATGTTTGAGGTTGCAGAACGTGTAGC ACTTGAAG CAGGATTTGTTGAATCAGGCGATAATATCGTTAT C 
GTTGCAGGTGTTCCTGTAGGTACAGGTGGAACTAACACAATGCGTGTTCGTACTGTTAAA 

SEQ ID NO. 22 

MNKRVKIVATLGPAVEFRGGKKFGESGYWGESLDVEASAEK 
ELFEDGADFHSYTTGTKLRVATKQGIKSTPEVIAIjWAGGI^ 
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IPFPALAERDNADIRFGLEQGLNFIAISFVRTAKDVNEVRAI^ 

KM 1 1 TKVNAAGKAVI TATNMLE TMTDKPRATRS E VSD VFNAVI DGTDATMLS GE S ANGKYP VE S VRTMAT I DKNAQTDLNE YGRLDSS AF PRNNKT 
DVI ASAVKDATHSMD IKLWT ITETGNTARAI SKFRPDADI IiAVTFDEKVQRSLMINWGVI P VLADKPASTDDMFEVAERVALEAGFVESGDNI VI 
VAGVPVGTGGTNTMRVRTVK 

SEQ ID NO. 23 

T TGT CTG CTATA&TAGACAAAAAGGTGGTGATATTTATGTATTTAGC ATT AAT CGGTGATAT CAT TAATT C AAAAC AGATACTTGA 
ACGTGAAACTTTCCAACAGTCTTTTCAGCAACTAATGACCGAACTATCTGATGTATATGGTGAAGAGCTGATTTCTCCATTCACTA 
TTACAGCTGGTGATGAATTTCAAGCTTTATTGAAACCATCAAAAAAGGTATTTGAAATTATTG 

CCTGTTAATGTAAGGTTCGGCCTCGGTACAGGAAACATTATAACATCCATCAATTCAAATGAAAGTATCGGTGCTGATGGTCCTGC 
CTAC TGGCATGCTCGCT CAGCTAT TAAT CATATACATGATAAAAATGATTATGGAACAGT T CAAGTAG CTATTTGC C TTGATGATG 
AAGAC C AAAACCTTGAATTAACAC TAAATAGTC T CATTT CAGC TGGTGAT T TTAT C AAGT C AAAATGGACTAC AAAC CATTTTCAA 
ATGCT TGAGCACTTAATAC T T CAAGATAAT TAT CAAGAACAAT TT C AAC ATCAAAAGT TAGC C C AACTGGAAAATATTGAAC CTAG 
TGCGCTGACTAAACGCGTTAAAGCAAGCGGTCTGAAGATTTACTTAAGAACGAGAACACAGGCAGCCGATCTATTAGTTAAAAGTT 
GCA.CTCAAACTAAAGGGGGAAGCTATGATTTC 

SEQ ED NO. 24 

MSAIIDKKWIFMYLALIGDIINSKQILERETFQQSFQQL^^ 

NI ITS INSNES IGADGPAYWHARS AINHIHDKND YGTVQVAI CLDDEDQNLELTIiNS LI S AGDFI KS KWTTNHFQMLEHLIIiQDNYQEQFQHQKIiAQ 
LENIEPSALTKRLKASGLKIYLRTRTQAADLLVKSCTQTKGGSYDF 

SEQ ID NO. 25 

ATGTTTTATACAATTGAAGAGCTGGTAGAGCAAGCTAATAGCCAACATAAGGGTAACATAGCAGAGCTCATGATCCAAACGGAAATTGAAATGACT 
GGTAGAAGTCGTGAAGAAATT CGTTATATTATGTCCCGAAATCTTGAAGTC ATGAAAG CTTCTGTTATTGATGGATTAACCC CT AGTAAATCAATC 
AGTGGTTTAACAGGCGGTGATGCTGTCAAGATGGATCAATATTTACAATCAGGAAAAACTATTTCAGATACCACAATCCTAGCTGCCGTTAGGAAT 
GCTATGGCTGTTAATGAGTTAAATGCTAAGATGGGACTGGTCTGTGCAACACCAACTGCAGGTAGTGCAGGATGTTTACCAGCTGTGATTTCTACA 
GCCATTGAAAAGCTTAATTTAACAGAAGAAGAGCAACTTGATTTTCTATTTACAGCCGGCGCATTTGGTCTCGTCATTGGTAATAATGCCTCTATC 
TCAGGTGCAGAAGGAGGTTGCCAAGCTGAAGTTGGGTC7\GCTAGTGCTATGGCTGCGGCTGCTTTAGTTATGGCTGCTGGAGGTACTCCTTTCCAA 
GCTAGCCAAGCTATAGCATTTGTTATTAAAAATATGCTTGGACTTATCTGTGACCCTGTTGCAGGTTTAGTTGAAGTCCCTTGTGTGAAGCGGAAT 
GCTCTTGGATCAAGTTTTGCACTTGTTGCTGCTGATATGGCCTTGGCTGGTATTGAATCGCAAATTCCAGTAGATGAAGTTATTGATGCAATGTAT 
CAAGTTGGATCAAGTTTACCGACTGCTTTTCGTGAGACTGCAGAAGGAGGACTTGCTGCGACGCCGACAGGAAGACGTTATAGTAAAGAAATTTTT 
GGGGAA 

SEQ ID NO. 26 

MFYTIEELVEQANSQHKGNIAELMIQTEIEMTGRSREEIRYIM 

AMAVNEIiNAKMGLVCATPTAGSAGCIiPAVI STAI EKLNLTEEEQLDFLFTAGAFGIjVIGNNAS I SGAEGGCQAEVGSASAMAAAAIiVMAAGGTPFQ 
ASQAI AF VI KNMLGLI CDPVAGLiVEVPCVKRNALGS S FALVAADMALiAGI ES QI PVDE VIDAMYQVGSSLPTAFRETAEGGLAATPTGRRYSKE I F 
GE 

SEQ ID NO. 27 

ATGAGCGTATATGTTAGTGGAATAGGAATTATTTCTTCTTTGGGAAAGAATTATAGCGAGCATAAACAGCATCTCTTCGACTTAAAAGAAGGAATTT 
CTAAACATTTATATAAAAATCACGACTCTATTTTAGAATCTTATACAGGAAGCATAACTAGTGACCCAGAGGTTCCTGAGCAATACAAAGATGAGAC 
ACGTAATTTTAAATTTGCTTTTACCGCTTTTGAAGAGGCTCTTGCTTCTTCAGGTGTTAATTTAAAAGCTTATCATAATATTGCTGTGTGTTTAGGG 
ACCTCACTTGGGGGAAAGAGTGCTGGTCAAAATGCCTTGTATCAAT 

TTTACCATATTGCTGATGAATTGATGGCTTATCATGATATTGTGGGAGCTTCGTATGTTATTTCAACCGCCTGTTCTGCAAGTAATAATGCCGTAAT 
ATTAGGAACACAATTACTTCAAGATGGCGATTGTGATTTAGCTATTTGTGGTGGCTGTGATGAGTTAAGTGATATTTCTTTAGCAGGCTTCACATCA 
CTAGGAGCTATTAATACAGAAATGGCATGTCAGCCCTATTCTTCTGGAAAAGGAATCAATTTGGGTGAGGGCGCTGGTTTTGTTGTTCTTGTCAAAG 
ATCAGTCCTTAGCTAAATATGGAAAAATTATCGGTGGTCTTATTACTTCAGATGGTTATCATATAACAGCACCTAAGCCAACAGGTGAAGGGGCGGC 
ACAGATTGCAAAGCAGCTAGTGACTCAAGCAGGTATTGACTACAGTGAGATTGACTATATTAACGGTCACGGTACAGGTACTCAAGCTAATGATAAA 
ATGGAAAAAAATATGTATGGTAAGTTTTTCCCGACAACGACATTGATCAGCAGTACCAAGGGGCAAACGGGTCATACTCTAGGGGCTGCAGGTATTA 
TCG AATTG ATTAATTGTTTAG CGGCAATAGAGGAAC AGACTGTAC C AG CAACTAAAAATGAGATTGGGATAGAAGGTTTTCC AGAAAATTTTGTCTA 
TCATCAAAAGAGAGAATACCCAATAAGAAATGCTTTAAATTTTTCGTTTGCTTTTGGTGGAAATAATAGTGGTGTCTTATTGTCATCTTTAGATTCA 
CCTCTAGAAACATTACCTGCTAGAGAAAATCTTAAAATGGCTATCTTATCATCTGTTGCTTCCATTTCTAAGAATGAATCACTTTCTATAACCTATG 
AAAAAGTTGCTAGTAATTTCAACGACTTTGAAGCATTACGCTTTAAAGGGGCTAGACCACCCAAAACTGTCAACCCAGC^CAATTTAGGAAAATGGA 
TGATTTTTCCAAAATGGTTGCCGTAACAAC^GCTCAAGCACTAA 

TTTACAACACTTTCTGGACCU^GTTGAGGTTGTTGAAGGTATTGAAAAGCAAATCACAACAGAAGGATATGCACATGTTTCTGCTTCACGATTCCCGT 
TTACAGTAATGAATGCAGCAGCTGGTATGCTTTCTATCATTTTTAAAATAACAGGTCCTTTATCTGTCATTTCGACAAATAGTGGAGCGCTTGATGG 
TATACAATATGCCAAGGAAATGATGCGTAACGATAATCTAGACTATGTGATTCTTGTTTCTGCTAATCAGTGGACAGACATGAGTTTTATGTGGTGG 
CAACAATTAAACTATGATAGTCAAATGTTTGTCGGTTCTGATTATTGTTCAGCACAAGTCCTCTCTCGTCAAGCATTGGATAATTCTCCTATAATAT 
TAGGTAGTAAACAATTAAAATATAGCCATAAAACATTCACAGATGTGATGACTATTTTTGATGCTGCGCTTCAAAATTTATTATCAGACTTAGGACT 
AACCATAAAAGATATCAAAGGTTTCGTTTGGAATGAGCGGAAGAAGGCAGTTAGTTCAGATTATGATTTCTTAGCGAACTTGTCTGAGTATTATAAT 
ATGCCAAACCTTGCTTCTGGTCAGTTTGGATTTTCATCTAATGGTGCTGGTGAAGAACTGGACTATACTGTTAATGAAAGTATAGAAAAGGGCTATT 
ATTTAGTCCTATCTTATTCGATCTTCGGTGGTATCTCTTTTGCTATTATTGAAAAAAGG 
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SEQ ED NO. 28 

MSVYVSGIGIISSLGKNYSEHKQHLFDLKEGISKHLYK 
TSLGGKSAGQNALYQFEEGERQVDASLLEKASVYHIADELMAYHD^ 

LGAINTEMACQPYS SGKGINLGEGAGFWLVKDQSLAKYGKI IGGLITSDGYHI TAPKPTGEGAAQI AKQLVTQAGIDYSE IDYINGHGTGTQANDK 
MEKNMYGKFFPTTTLI S STKGQTGHTLGAAGI I ELINCLAAI EEQTVPATKNE IGIEGFPENFVYHQKRE YP IRNALNFSFAFGGNNSGVLLSSLDS 
PLETLPARENLKMAILSSVASISKNESLSITYEKVASNFNDFEALRFKGARPP 
FTTLSGPVEVVEGIEKQITTEGYAHVSASRFPFTVMNA 

QQLNYDSQMFVGSDYCSAQVLSRQALDNSPI ILGSKQLKYSHKTFTDVMTI FDAALQNLLSDLGLTI KD I KGFVWNERKKAVSSDYDFLANLSEYYN 
MPNIiASGQFGFSSNGAGEELDYTVNESIEKGYYLVLSYSIFGGISFAIIEKR 

SEQ ID NO. 29 

ATGAAAATAGATGACCTAAGAAAAAGCGACAATGTTGAAGATCGTCGCTCCAGTAGCGGAGGTTCATTCTCTAGCGGAGGAAGTGGATTACCGATT 
CTTCAACTTTTATTGCTGCGAGGGAGTTGGAAAACCAAGCTTGTGGTTTTAATCATCTTACTGCTACTTGGCGGAGGGGGACTAACCAGCATTTTT 
AATGACTCATCCTCAC CTTCTAGTTAC C AATCTCAGAATGTCTCACGTTCTGTTG ATAATAGCGCAACGAGAGAACAAATCGATTT CGTTAATAAA 
GTCCTTGGCTCAACTGAGGATTTCTGGTC^CAAGAATTCCAAA 

ATTCAAACAGGTTGTGGTATAGGTGAAT CTGCTTCAGGACC ATTTTATTGTTCAGCAGATAAAAAAAT CTAT CTTGATATTTCTTTTTACAATGAA 
TTATCACATAAATATGGTGCTACTGGTGATTTTGCTATGGCCTACGTCATCGCCCACGAAGTTGGTCACCACATTCAAACAGAGTTAGGCATTATG 
GATAAGTATAATAGAATGCGACACGGACTTACTAAGAAAGAAGCAAATGCTTTAAATGTTCGGCTAGAACTTCAAGCAGATTATTATGCAGGGGTA 
TGGGCTCACTACATCAGGGGAAAAAATCTCTTAGAACAAGGAGACTTTGAAGAGGCCATGAATGCTGCCCACGCCGTCGGAGACGATACCCTTCAG 
AAAGAAACCTACGGAAAATTAGTGCCTGATAGCTTTACCCATGGAACAGCTGAACAACGCCAACGTTGGTTTAACAAAGGCTTTCAATATGGTGAC 
ATCCAACACGGTGATACTTTCTCCGTAGAACATCTA 

SEQ ID NO. 30 

MKIDDLRKSDWEDRRSSSGGSFSSGGSGLPILQLLIiLRGSWCT 
VLGSTEDFWSQEFQTQGFGOTKEPKLVLYTNSIQTGCGIGESASGPF 

DKYNRMRHGLT KKE ANAIiNVRLE LQAD YYAGVWAHY I RGKNLIiEQGDFE E AMNAAHAVGDDTLQKET YGKLVPD S FTHGTAEQRQRWFNKGFQYGD 
IQHGDTFSVEHL 

SEQ ID NO. 31 

ATGAAAAGATTACATAAACTGTTTATAACCGTAATTGCTACATTAGGTATGTTGGGGGTAATGACCTTTGGTCTTCCAACGCAGCCGCAAAACGTA 

ACG C CG AT AGTAC ATGCTGATGT C AATT CATCTGTTGATACGAGC CAGGAATTTCAAAATAATTTAAAAAATGCTATTGGT AAC CTAC C ATTTCAA 

TATGTTAATGGTATTTATGAATTAAATAATAATCAGACAAATTTAAATGCTGATGTCAATGTTAAAGCGTATGTTCAAAATACAATTGACAATCAA 

CAAAGACTATCAACTGCTAATGCAATGCTTGATAGAACCATTCGTCAATATCAAAATCGCAGAGATACCACTCTTCCCGATGCAAATTGGAAACCA 

TTAGGTTGGCATCAAGTAGCTACTAATGACCATTATGGACATGCAGTCGACAAGGGGCATTTAATTGCCTATGCTTTAGCTGGAAATTTCAAAGGT 

TGGGATGCTTCCGTGTCAAATCCTCAAAATGTTGTCACACAAACAGCTCATTCCAACCAATCAAATCAAAAAATCAATCGTGGACAAAATTATTAT 

GAAAGCTTAGTTCGTAAGGCGGTTGACCAAAACAAACGTGTTCGTTACCGTGTAACTCCATTGTACCGTAATGATACTGATTTAGTTCCATTTGCA 

ATGCACCTAGAAGCTAAATCACAAGATGGCACATTAGAATTTAATGTTGCTATTCCAAACACACAAGCATCATACACTATGGATTATG 

GAAATAACACTAAAT 

SEQ ID NO. 32 

MKRLHKLFITVIATLGMLGVMTFGLPTQPQNWPIVHADV^ 
QRLSTANAMLDRTIRQYQNRRDTTIiPDANWKPLGWHQVATN^ 

E S LVRKAVDQNKRVRYRVT P LYRNDTDLVPFAMHLEAKS QDGTLE FNVAI PNTQAS YTMD YATGE I TIjN 

j 

SEQ ID NO. 33 

ATGAGTAAACGACAAAATTTAGGAATTAGTAAAAAAGGAGCAATTATATCAGGGCTCTCAGTGGCACTAATTGTAGTAATAGGTGGCTTTTTATGG 
GTACAATCTCAACCTAATAAGAGTGCAGTAAAAACTAACTACAAAGTTTTTAATGTTAGAGAAGGAAGTGTTTCGTCCTCAACTCTTTTGACAGGA 
AAAGCTAAGGCTAATCAAGAACAGTATGTGTATTTTGATGCTAATAAAGGTAATCGAGCAACTGTCACAGTTAAAGTGGGTGATAAAATCACAGCT 
GGTCAGCAGTTAGTTCAATATGATACAACAACTGCACAAGCAGCCTACGACACTGCTAATCGTCAATTAAATAAAGTA 

CTAAAGACAACAGGAAGTCTTCCAGCTATGGAATCAAGTGATCAATCTTCTTCATCATCACAAGGACAAGGGACTCAATCGACTAGTGGTGCGACG 

AATCGTCTACAGCAAAATTATCAAAGTCAAGCTAATGCTTCATACAACCAACAACTTCAAGATTTGAATGATGCTTATGCAGATG 

GTAAATAAAGCACAAAAAGCATTGAATGATACTGTTATTACAAGTGACGTATCAGGGACAGTTGTTGAAGTTAATAGTGATATTGATCCAGCTTCA 

AAAACTAGTCAAGTACTTGTCC^TGTAGCAACTGAAGGTAAACTCCAAGTACAAGGAACGATGAGTGAGTATGATTTGGCTAATGTTAAAAAAGAC 

CAGGCTGTTAAAATAAAAT CTAAGGTCTATCCTGACAAGGAATGGGAAGGTAAAATTT CATATATCTCAAATT AT C CAG AAGCAGAAGCAAACAAC 

AATGACTCTAATAACGGCTCTAGTGCTGTAAATTATAAATATAAAGTAGATATTACTAGCCCTCTCGATGCATTAAAACAAGGTTTTACCGTATCA 

GTTGAAGTAGTTAATGGAGATAAGC^CCTTATTGTCCCTACAAGTTCTGTGATA 

AATCGTAAAATTT C CAAAGTTGAAGTCAAAATTGGTAAAGCTGATGCTAAGACACAAGAAATTTTATCAGGTTTGAAAGCAGGACAA CGTGGTT 
ACTAATCC^GTAAAACCTTCMGGATGGGCAAAAAATTGATAATATTGAATCAATCGATCTTAACTCTAATAAGAAATCAGAGGTGAAA 

SEQ ID NO. 34 
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MS KRQNLGI SKKGAI I SGL S VAL I WIGGFLWVQS Q PNKS AVKTNYKVFNVREGS VS S STLLTGKAKANQEQ YVYFDANKGNRATVTVKVGDKI TAG 
QQLVQYDTTTAQAAYDTANRQLNK^ARQINNLKTTGSIiPAMESSDQS S S S SQGQGTQSTSGATNRLQQNYQSQANASYNQQLQDLlSrDAYADAQAEVN 
KAQKALmDTVITSDVSGTVVEVNSDIDPASKTSQVLVHVATEG 

NNGSSAVNYKYKVD ITS PLDALKQGFTVSVE WNGDKHLI VPTS S VINKDNKHFVWVYITOSNRKISKVEVKIGKADAKTQEILSGLKAGQIVVTNPS 
KTFKDGQKIDNIESIDLNSNKKSEVK 

SEQ ID NO. 35 

ATGAAAAAAATTGGAATTATTGTCCTCACACTACTGACCTTCTTTTTGGTAT 

TCTAAAATGCCTAAAATTGAAGGCTTCACCTATTATGGAAAAATTCCTGAAAATCCGAAAAAAGTAATTAATTTTACATATTCTTACACTGGGTAT 
TTATTAAAACTAGGTGTTAATGTTTCAAGTTACAGTTTAGACTTAGAAAAAGATAGCCCCGTTTTTGGTAAACAACTGAAAGAAGCTAAAAAATTA 
ACTGCTGATGATACAGAAGCTATTGCCGCACAAAAACCTGATTTAATCATGGTTTTCGATCAAGATCCAAACATCAATACTCTGAAAAAAATTGCA 
CCAACTTTAGTTATTAAATATGGTGCACAAAATTATTTAGATATGATGCCAGCCTTGGGGAAAGTATTCGGTAAAGAAAAAGAAGCTAATCAGTGG 
GTTAGCGAATGGAAAACTAAAACTCTCGCTGTCAAAAAAGATTTACACCATATCTTAAAGCCTAACACTACTTTTACTATTATGGATTTTTATGAT 
AAAAATATCTATTTATATGGTAATAATTTTGGACGCGGTGG AGAACTAATCTATGATTC ACTAGGTTATG CTGC CC CAGAAAAAGT CAAAAAAGAT 
GTCTTTAAAAAAGGGTGGTTTACCGTTTCGCAAGAAGCAATCGGTGATTACGTTGGAGATTATGCCCTTGTTAATATAAACAAAACGACTAAAAAA 
GCAGCTTCATCACTTAAAGAAAGTGATGTCTGGAAGAATTTACCAGCTGTCAAAAAAGGGCAC^TCATAGAAAGTAACTACGACGTGTTTTATTTC 
TCTGACCCTCTATCTTTAGAAGCTCAATTAAAATCATTTACAAAGGCTATCAAAGAAAATACAAAT 

SEQ ID NO. 36 

MKKIGIIVLTLLTFFLVSCGQQTKQESTKCT 

GKQLKEAKKLTADDTEAIAAQKPDLIMVFDQDPNINTLKKIAPT^^ 

KDLHHILKPNTTFTIMDFYDKNIYLYGNNFGRGGELIYDSLGYA^ 

SSLKESDVWKNLPAVKKGHIIESNYDVFYFSDPLSLEAQIiKSFTKAIKENTN 

SEQ ID NO. 37 

ATGAAAGTGAAAAATAAGATTTTAACGATGGTAGCACTTACTGTCTTAACATGTGCTACTTATTCATCAATCGGTTATGCTGATACAAGTGATAAGA 

ATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGAACTAGACdAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATC 

GAGTTCATCAAGTGAAC GAGAAACAAATCCGTCAACTAAT CCACCTACAACAGAACCATCGCAACCCT CACCTAGTGAAGAGAACAAGCCTGATGGT 

AGAACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAAGCAAGTA 

GTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGA^ 

AGGAGATAGCCACTCAGATACTGTAATAGCATCTACGGGAGGGATTATTCTGTTATCATTAAGTTTTTAGAATAAGAAAATGAAACTTTAT 

SEQ ID NO. 38 

MKVKNKI LTMVALTVLTCAT YS S IGYADTSDKNTDTS WTTTLSEEKRSDELDQS STGS S SENE S S S SSEPETNPSTNPPTTEPSQPSPSEENKPDG' 
RTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKANDGKKGHSKPKKELPKTGDSHSDTVIASTGGIILLSLSFY^ 



SEQ ID NO. 39 

ATGAAAAGGATACGGAAAAGCCTTATTTTTGTTCTCGGAGTAGTTACCCTAATTTGCTTATGTGCTTGTACTAAACAAAGCCAGCAAAAAAATGGCT 
TGTCAGTAGTGACTAGCTTTTATCCAGTATATTCCATTACAAAAGCAGTTTCTGGTGATTTGAATGATATTAAAATGATTCGATCACAGTCAGGTAT 
TCATGGTTTTGAACCCTCATCAAGTGATGTTGCTGCCATTTATGATGCTGATCTATTTCTTTATCATTCGCACACACTAGAAGCTTGGGCGAGACGT 
TTGGAACCTAGTTTGCATCACTCTAAAGTATCTGTAATTGAAGCTTCAAAAGGTATGACTTTGGATAAAGTTCATGGCTTAGAAGATGTAGAGGCAG 
AAAAAGGAGTAGATGAGTCAACCTTGTATGACCCTCACACTTGGAATGACCCTGTAAAAGTATCTGAGGAAGCAC^CTCATCGCTACAC^ATTAGC 
TAAAAAGGATCCTAAAAACGCTAAGGTTTATCAAAAAAATGCTGATCAATTTAGTGACAAGGCAATGGCTATTGCAGAGAAGTATAAGCCAAAATTT 
AAAGCTGCAAAGTCTAAATACTTTGTGACTTCACATACAGCATTCTCATACTTAGCTAAGCGATACGGATTGACTCAGTTAGGTATTGCAGGTGTCT 
CAACCGAGC^GAACCTAGTGCTAAAAAATTAGCCGAAATTCAGGAGTTTGTGAAAACATATAAGGTTAAGACTATTTTTGTTGAAGAAGGAGTCTC 
ACCTAAATTAGCTCAAGCAGTAGCTTCAGCTACTCGAGTTAAAATTGCAAGTTTAAGTCCTTTARAAGCAGTTCCCAAAAACAATAAAGATTACTTA 
GAAAATTTGGAAACTAATCTTAAGGTACTTGTCAAATCGTTAAATCAATAG 

SEQ ID NO. 40 

MKRIRKSLIFVLGVVTLICLCACTKQSQQKNGLSVVTSFYPVYSITKAVSGDLNDIKM 
LEPSLHHSKVSVIEASKGMTLDKVHGLEDVEAEKGVDESTLYDPHTWNDPVKVSEEAQLIATQLAKKDPKN 

KAAKS KYF VTSHTAFS YLAKRYGLTQLG I AGVS TEQE PS AKICIiAE I QE F VKT YKVKT I F VE EGVS P KIiAQAVAS ATRVKIAS L S PLXAVPKNNKD YL 
ENLETNLKVLVKSLNQ 



SEQ ID NO. 41 

ATGCCTAAGAAGAAATCAGATACCCCAGAAAAAGAAGAAGTTGTCTTAACGGAATGGCAAA 

ATGAAGAAGAACAAAAACGTATTAACGAAAAATTACG CTTAGATAAAAGAAGTAAATTAAATATTTCTT CT CCTGAAGAAC CTCAAAATACTACTA 

AATTAAGAAGCTTCATTTTCCAAAGATTTCAAGACCTAAGATTGAAAAGAAACAGAAAAAAGAAAAAATAGTCAACAGCTTAGCC^AAACTAATCGC 

ATTAGAACTGCACCTATATTTGTAGTAGCATTCCTAGTCATTTTAGTTTCGGTTTTCCTACTAACTCCTTTTAGTAAGCAAAAAACAATAACAGTTA 

GTGGAAATCAGCATACACCTGATGATATTTTGATAGAGAAAACGAATATTCAAAAAAACGATTATTTCTTTTCTTTAATTTTTAAAC^TAAAGCTA 

TGAACAACGTTTAGCTGCAGAAGATGTATGGGTAAAAACAGCTCAGATGACTTATCAATTTCCCAATAAGTTTCATATTCAAGTTCAAGAAAATAAG 
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ATTATTGCATATGCACATACAAAGCAAGG 

TCTTAACAATTAACCTTGATAAGGAAGATAGTATTAAGCTATTAATTAAAGATTTAAAGGCTTTAGACCCTGATTTAATAAGTGAGATTCAGGTGAT 

AAGTTTAGCTGATTCTAAAACGACACCTGACCTCCTGCTGTTAGATATGCACGATGGAAATAGTATTAGAATACCATTATCTAAATTTAAAGAAAGA 

CTTCCTTTTTACAAACAAATTAAGAAGAACCTTAAGGA^ 

CCCCTGTTAAAGCAGAAGATACAAAAAATAAATCAACT 

CTGAAA.TACTAATCAACAAGGACAACAGATAGCAACAGAGCAGGCACCTAACCCTCAAAATGTTAAT 

SEQ ED NO. 42 

MPKKKSDTPEKEEVVIiTEWQKRNLEFDKI^ 

IRTAP IFWAFIiVI LVSVFLLTPFSKQKT ITVSGNQHTPDDI LI EKTNIQKNDYFFSIiI FKHKAI EQRLAAEDVWVKTAQMTYQFPNKFHI QVQENK 

IIAYAHTKQGYQPVIiETGKKADPWSSELPKHFLT^ 

LPFYKQIKKITLKEPSIVDMEVGVYTTTNTIEST^ 
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